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^ (54) Title: ANALYSIS METHOD 



(57) Abstract: This invention relates to novel methods for the identification of genes and gene products that are implicated in cer- 



tain disease states. According to the invention, there is provided a method for the identification of a gene that is implicated in a 
' — ^ specific disease or physiological condition, said method comprising the steps of comparing: i) the transcriptome or proteome of a 
^-^ first specialised cell type that is implicated in the disease or condition under first and second experimental conditions; with ii) the 
transcriptome or proteome of a second specialised cell type under said first and said second experimental conditions; and identifying 
as a gene implicated in the disease or physiological condition, a gene that is differentially regulated in the two specialised cell types 
under the first and second experimental conditions. The invention also relates to novel genes and gene products identified using these 
methods. 
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Analysis method 

This invention relates to novel methods for the identification of genes and gene products that are 
implicated in certain disease states. The invention also relates to novel genes and gene products identified 
using these methods. 

5 All publications, patents and patent applications cited herein are incorporated in full by reference. 

One of the central goals in the field of gene expression is to understand and elucidate the relationship 
between a particular disease state and the gene expression pattern that defines and/or causes this disease 
state. Research has concentrated on differences in expression patterns between diseased and healthy 
tissues to elucidate the physiological mechanisms of disease. Identified differences in expression patterns 
10 provide putative points for therapeutic intervention to reverse the disease phenotype. These differences 
also provide markers that are useful for diagnosis, and identify proteins for further investigation as agents 
implicated in the disease in question. 

Conventional methods for the elucidation of mechanisms of disease tend to concentrate on the correlation 
of a disease state with altered levels of a particular protein, Such methods include techniques of 
15 immunohistochemistry, the study of differential mRNA expression and the sequence analysis of particular 
proteins to identify mutations that are associated with a certain disease state. 

Recently, research has concentrated on analysis of the transcriptomes of organisms and cell types that are 
considered to be of scientific interest. By "transcriptome" is meant the exact set of transcripts that are 
expressed in a cell. The emerging field of nucleic acid arrays is one field in which a large number of 

20 powerful tools are being generated for the study of transcriptome variation between different tissue types. 
These tools are based on techniques originally pioneered by Schena et ah, 1995 (Science 270: 467-470) 
and Fodor et aL, 1991 (Science 251, 767-773) and facilitate the evaluation of variations in DNA or RNA 
sequences and of variations in expression levels from tissue samples and allow the identification and 
genotyping of mutations and polymorphisms in these sequences. The power of one such technique has 

25 recently been demonstrated by Perou et al, (Nature, 2000, 406:747-752), who generated molecular 
portraits of the transcriptomes of human breast tumours. 

Over recent years, the so-called "genomics revolution" has allowed access to large portions of whole 
genomes, including the human genome. The amount of sequence information now available considerably 
facilitates the analysis of the results of experiments that aim to elucidate the differences between gene 
30 expression in diseased and healthy tissues. As this information increases in scope and becomes more 
readily available, the study of the molecular mechanism of disease, and the elucidation of techniques for 
combatting these diseases will be considerably facilitated. 

However, there are notable disadvantages associated with all methods that are currently employed for the 
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analysis of human disease. Many methods currently employed utilise established cell lines. Because these 
cells have been manipulated to allow their immortalisation in cell culture, the physiological situation in 
these cells is not considered by the present inventors to be generally representative of the authentic 
situation in equivalent cells in vivo. Furthermore, most of these methods tend to utilise a global strategy 
5 for intervention, often ignoring the intricacies in gene expression that exist between different tissues. 
There thus remains a great need for the establishment of novel methods for the analysis of gene 
expression. 

According to the invention, there is provided a method for the identification of a gene that is implicated in 
a specific disease or physiological condition, said method comprising the steps of: 

10 a) comparing: 

i) the transcriptome or proteome of a first specialised cell type that is implicated in the 
disease or condition under first and second experimental conditions; with 

ii) the transcriptome or proteome of a second specialised cell type under said first and 
said second experimental conditions; and 

15 b) identifying as a gene implicated in the disease or physiological condition, a gene that is 

differentially regulated in the two specialised cell types under the first and second 
experimental conditions. 

Using this method, genes have been identified that respond to perturbations of cell physiology in a cell- 
specific rather than a generic fashion. The method of the invention exhibits significant advantages over 
20 conventional methods of identifying genes that are implicated in disease. 

Various groups have previously investigated mechanisms of physiological regulation, by comparing gene 
expression levels in the presence and absence of a physiological stimulus or challenge. Genes identified 
in a particular cell type as being expressed at different levels under different conditions are implicated as 
components of a pathway that is responsive to the altered conditions, or that is regulated differently under 
25 the altered conditions. However, these methods exhibit a tendency to ignore patterns of gene expression 
that are physiologically relevant. This inclination is considered to result from a prejudice in the art that 
dictates that cells respond to changes in certain physiological conditions in a generic fashion, rather than 
in a cell specific fashion. 



By "implicated in a specific disease or physiological condition" is meant that the gene has been found to 
30 possess a distinct role in a pathway that is involved in susceptibility to, generation of or maintenance of a 
particular disease phenotype or physiological condition. As will be apparent to the skilled reader, any 
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point in any pathway may be the unique point at which a cell departs from the normal physiological 
response and generates a disease phenotype. Often the effect that is manifested as a disease is the result of 
a mutation event, in which a mutation occurs in the sequence of a gene encoding a protein that functions 
in a relevant physiological pathway. 

5 There are numerous examples of diseases and conditions that may be studied using the method of the 
invention. Such pathological conditions include those that result from a change in the intrinsic nature of a 
cell (usually genetic) or from a change in the cellular microenvironment, either of which might be 
recapitulated in a laboratory setting. The methods may be applied to any disease or condition that is 
manifested in, or is generated in a specific cell type. 

10 Examples of such conditions include changes in the cellular microenvironment, exposure to hormones, 
growth factors, cytokines, chemokines, inflammatory agents, toxins, metabolites, pH, pharmaceutical 
agents, hypoxia, anoxia, ischemia, imbalance of any plasma-borne nutrient [including glucose, amino 
acids, co-factors, mineral salts, proteins and lipids], osmotic stress, temperature [hypo and hyper- 
thermia], mechanical stress, irradiation [ionising or non-ionising], cell-extracellular matrix interactions, 

15 cell-cell interactions, accumulations of foreign or pathological extracellular components, intracellular and 
extracellular pathogens [including bacteria, viruses, fungi and mycoplasma] and genetic perturbations 
[both epigenetic or mediated by mutation or polymorphism]. 

Examples of such diseases include cardiovascular disease, atherosclerosis, inflammatory conditions 
(including rheumatoid arthritis), cancer, ischemic disease, asthma, hematopoietic disorders, neurological 
20 diseases including Parkinson's and Alzheimer's diseases, infectious disease and allergies. 

One particular physiological response that has been used herein to illustrate the invention is the cellular 
response to hypoxia, The term "hypoxia" is intended to refer to an environment of reduced oxygen 
tension, as compared to the normal physiological environment for a particular organism, which is termed 
u normoxia". The prejudice in this technical field presents the view that there is a general, ubiquitous 
25 response to hypoxia, mediated primarily at the level of mRNA (transcriptional initiation and post- 
transcriptional stabilisation). 

In a variety of human diseases, cells are exposed to conditions of low oxygen tension, usually as a result 
of poor oxygen supply to the diseased area, For instance, tissue oxygenation plays a significant regulatory 
role in both apoptosis and in angiogenesis (Bouck et ol, 1996, Adv. Cancer Res. 69:135-174; Bunn et af, 
30 1996, Physiol. Rev. 76:839-885; Dor et ah 1997, Trends Cardiovasc. Med., 7:289-294; Carmeliet et al, 
1998, Nature 394:485-490). Apoptosis (see Duke et al, 1996, Sci. American, 80-87 for review) and 
growth arrest occur when cell growth and viability are reduced due to oxygen deprivation. Angiogenesis 
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(i.e. blood vessel growth, vascularization), is stimulated when hypooxygenated cells secrete factors that 
stimulate proliferation and migration of endothelial cells in an attempt to restore oxygen homeostasis (for 
review see Hanahan et al, 1996, Cell, 86:353-364), 

Ischaemic disease pathologies involve a decrease in the blood supply to a bodily organ, tissue or body 
5 part generally caused by constriction or obstruction of the blood vessels. For example, solid tumours 
typically have a disorganised blood supply, leading to hypoxic regions, Other disease conditions 
involving hypoxia include stroke, atherosclerosis, retinopathy, acute renal Failure, myocardial infarction, 
stroke and hair loss. Therefore, apoptosis and angiogenesis as induced by the ischaemic condition are also 
considered to be involved in these disease states. It is generally considered that understanding the 
10 mechanism by which cells respond to these diseases may be the key to the disease pathology and thus 
relevant to disease treatment. 

In a different but related approach, it is now recognised that angiogenesis is necessary for tumour growth 
and that retardation of this process provide a useful tool in controlling malignancy and retinopathies. For 
example, neoangiogenesis is seen in many forms of retinopathy and in tumour growth. The ability to be 
15 able to induce tumourigenic cells to undergo apoptosis is an extremely desirable goal; particularly in the 
cancer field, it has been observed that apoptosis and angiogenesis-related genes provide potent 
therapeutic .targets. It has also been observed that hypoxia plays a critical role in the selection of 
mutations that contribute to more severe tumourigenic phenotypes (Graeber et al, 1996 Nature, 
379(6560):88-91). 

20 Early in the history of this field it was discovered that a transcription factor, HIF-lalpha, is ubiquitously 
present in cells and is responsible for the induction of a number of genes in response to hypoxia. This 
protein is considered a master regulator of oxygen homeostasis (see, for example, Semenza, (1998) Curr. 
Op. Genetics and Dev. 8:588-594). Where HIF1 alpha is genetically knocked out, the hypoxia-inducible 
transcription of virtually all glycolytic enzymes has been shown to be inhibited. Glycolysis is an essential 

25 process which goes on in all mammalian cells. This finding is therefore consistent with previous work 
showing that when cells are exposed to conditions of hypoxia, they up-regulate glycolytic enzymes to 
enable ATP production, since oxidative phosphorylation is no longer feasible under conditions of low 
oxygen (Webster (1987) Mol.CelLBiochem, 11: 19-28). Further support for a critical and general role of 
HIF1 alpha in the hypoxic response is demonstrated by the knockout mouse, which dies at day 10.5 of 

30 gestation. The same is true of the knockout of the ARNT protein, the dimerisation partner of HIF1 alpha. 

For the first time, it is demonstrated herein that different tissues and cell types exhibit a very different 
response to hypoxia, at the level of the induction and repression of gene expression, This has allowed the 
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detailed elucidation of the mechanism of this particular physiological response, so paving the way for the 
development of improved therapeutic agents that target components of the response pathway in particular 
tissues, Although conventional approaches to the analysis of this mechanism have successfully identified 
numerous genes, because of the universal prejudice in the art that these components will all be 
5 induced/repressed similarly in all cell types, all the approaches suggested have hitherto been limited to the 
design of therapeutic agents that act in a global fashion, 

The methods of the present invention therefore extend and add to previous work performed in this field, 
in that the discoveries made now allow the design of agents that target the hypoxic response in specific 
tissues. For example, it is known that brain and heart tissues die very rapidly after ischaemic insult. By 

10 using the method of the invention, it is quite possible that these tissues will be found to share common 
features in their response to hypoxia, that is different from other cell types. This might allow, for 
example, the design of a combination cardioprotective and neuroprotective agent effective against this 
subset of body tissues. Alternatively, the hypoxic response in these tissues might be found to be quite 
different This information would then be taken into account when designing therapeutic 

15 countermeasures, in that an agent would be designed for the unique neurological or cardiological tissue 
concerned. 

The method of the invention involves the comparison of the transcriptomes or proteomes of at least two 
specialised cell types under two different physiological or genetic conditions. By "transcriptome" is 
meant the exact set of transcripts that are expressed in a cell. The transcriptome thus has a qualitative 
20 element (the identity of individual gene transcripts) and a quantitative element (the proportion of each 
unique transcript in the total number of individual transcripts present in the cell at a particular moment). 
By "proteome" is meant the exact set of protein molecules that are expressed in a cell. 

By "specialised cell type" is meant a cell type that has a restricted biochemical capacity and that can be 
unambiguously identified as possessing a unique set of biochemical and physiological functions, 
25 Preferably, the specialised cells are primary cells, and not cell lines or whole body tissues. Primary cells 
are cells that cannot proliferate indefinitely in culture. Primary cells can be derived from adult tissue, or 
from embryo tissue that is differentiated in culture to an adult cell or to a precursor of an adult cell that 
displays specialised characteristics. 

Examples of preferred specialised cell types include cardiomyocytes, endothelial cells, sensory neurons, 
30 motor neurons, CNS neurons (all types), astrocytes, glial cells, Schwann cells, mast cells, eosinophils, 
smooth muscle cells, skeletal muscle cells, pericytes, lymphocytes, tumor cells, monocytes, macrophages, 
foamy macrophages, granulocytes, synovial cells / synovial fibroblasts, epithelial cells (varieties from all 
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tissues/ organs). Examples of other suitable specialised cell types include vascular endothelial cells, 
smooth muscle cells (aortic, bronchial, coronary artery, pulmonary artery, etc), skeletal muscle cells, 
cardiomyocyte cells, fibroblasts (many types, such as synovial), keratinocytes, hepatocytes, dendritic 
cells, astrocytes, neurone cells (including mesencephalic, hippocampal, striatal, thalamic, hypothalamic, 
5 olfactory bulb, substantia nigra, locus coeruleus, cortex, dorsal root ganglia, superior cervical ganglia, 
sensory, motor, cerebellar cells), neutrophils, eosinophils, basophils, mast cells, monocytes, macrophage 
cells, erythrocytes, megakaryocytes, hematopoietic progenitor cells, hematopoietic pluripotent stem cells, 
any stem cells, any progenitor cells, epithelial cells, melanocytes, osteoblasts, osteoclasts, stromal cells, 
purkinje cells, T-cells, B -cells, synovial cells, pancreatic islet cells (alpha and beta), leukemia cells, 
10 lymphoma cells, tumour cells, retinal cells, adrenal chromaffin cells. As will be apparent to the skilled 
reader, it is not here possible to provide an exhaustive list of specialised cell types that may be studied 
according to the methods of the present invention. 

Intended as being included within the method of the invention is the possibility of using, as two different 
specialised cell types, two different physiological states of the same cell type, for example, activated and 
15 resting macrophages. 

The transcriptomes of the specialised cell types are compared under different experimental conditions. 
The term "experimental conditions" is used broadly in this context and is intended to embrace any 
physiological or genetic conditions to which a cell type may be exposed. The intention of the method is to 
compare the transcriptomes or proteomes of the cell types under different experimental conditions that 
20 have a physiological relevance. Accordingly, the state of the transcriptome or proteome under one set of 
experimental conditions will generally act as a control against which the transcriptome or proteome may 
be compared under a second set of experimental conditions. Any distinct physiologically-relevant 
conditions may therefore be of interest. 

Examples of suitable physiological experimental conditions include conditions under which the cell is 
25 submitted to a physiological, mechanical, temperature, chemical, toxic or pharmaceutical stress. One 
example is hypoxia, defined herein as a physiological state in which oxygen demand by the cell exceeds 
its supply to the cell. The transcriptome or proteome under this set of experimental conditions may be 
compared to the transcriptome or proteome under conditions of normoxia, when oxygen supply is in 
concordance with the demand by the cell. 

30 The transcriptomes or proteomes may also be compared under different genetic conditions. By "genetic 
conditions" is meant that the genotype of the compared cell populations contains a different genetic 
component. This may be the presence of one or more different, non-endogenous nucleic acid molecules in 
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the cell, herein referred to collectively as "genetic elements". Such genetic element(s) may potentially be 
incorporated into the genome of the cell, or alternatively may exist as a separate genetic entity, for 
example, as an extra-chromosomal element such as a plasmid or episome. Alternatively, the genome may 
have been perturbed by external intervention, for example, to increase or decrease the expression of a 
5 particular gene or genes. A number of variations on this theme are possible, including the overexpression 
of a genetic element via the administration of the functional gene, the overexpression of a genetic element 
via the administration of a regulator of the functional gene (such as, for example, a transcription factor 
[either natural or artificially constructed via the fusion of a DNA binding domain with an activator 
domain]), the inhibition of the expression of a functional gene (for example, using antisense RNA or 
10 ribozymes), the inhibition of the expression of a functional gene (for example, using a transdominant 
protein) and the inhibition of the expression of a functional gene (for example, using a repressor protein 
that is either natural or artificially constructed from a DNA binding protein fused to a repressor domain). 

A particular example of a genetic perturbation as envisaged herein, that forms one preferred embodiment 
of the method of the present invention, is the so-called "Smartomics" technology that forms the basis for 

15 co-pending, co-owned International patent application PCT/GB01/00758, According to this technology, a 
heterologous nucleic acid is introduced into a primary cell to augment a specific natural physiological 
response, "Smartomics" may be applied to the current invention by measuring and comparing cellular 
responses to a heterologous gene in two or more distinct cell types, both with and without the natural 
physiological stimulus. Lentivirus technology is used to introduce the heterologous nucleic acid molecule 

20 in such a way that there is negligible perturbation of endogenous gene expression. For this reason, this 
technology exhibits significant benefits over conventional technology of a similar nature, since the prior 
art methods are generally invasive, having downstream effects other than the simple introduction of the 
heterologous nucleic acid molecule. The Smartomics technology allows much more precise 
measurements to be taken of the effect of introducing the heterologous nucleic acid. 

25 The method of the invention allows the identification of genes that are implicated in a specific disease or 
physiological condition. The genes identified in this way are candidate targets for antagonists or agonists 
that modulate disease states pertinent to that specialised cell type. This allows the development of 
selective agonists and antagonists, rather than broad spectrum agonists and antagonists. This approach 
thus adds value in the selective treatment of disease. Furthermore the identified genes are associated with 

30 regulatory elements that provide alternative and additional candidate targets for exploitation for the 
delivery of gene products to that cell in a cell-specific fashion. The genes and regulatory elements 
identified according to the method of the invention can be used directly in therapeutic applications via 
gene therapy, via recombinant protein methods or via chemical mimetics or as targets for the 
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development of agonists and antagonists such as antibodies, small chemical molecules, peptides, 
regulatory nucleic acids, 

The step of comparison of the transcriptomes or proteomes of the first and second specialised cell types 
under first and second experimental conditions may be effected using any approach that allows the 
5 quantitative comparison of gene or protein expression, and a number of such means will be known to 
those of skill in the art. Such experiments have only become possible in recent years, due to certain 
advances in technology that have allowed the large scale, high throughput analysis of gene expression, 

One example of a method that allows the comparison of the transcriptome of a specific cell type with a 
second or subsequent transcriptome involves the generation of a set of clones that represent all the 
10 transcripts expressed in a cell under the conditions in which the cell is maintained, This may be done by 
constructing a cDNA library, in which copies of all mRNA transcripts expressed in the cell are cloned 
into a suitable vector for subsequent analysis. 

Such libraries may be normalised cDNA libraries, in which the distribution of genes in the library has 
been biased to reduce the number of clones that represent genes with large numbers of transcripts (such 
15 as, for example, beta-actin) and thus reduce the repetitive nature of the library, Normalisation thus acts to 
reduce the frequency of genes expressed at high levels and to enhance the frequency of genes expressed at 
low levels (see de Fatima Bonaldo et ah, Genome Research 6: 791-806 (1996)). 

Libraries may also be subtracted cDNA libraries, in which the distribution of genes is manipulated to 
remove genes that are expressed in both mRNA populations used to construct the library, The 
20 commercially-available PCR Select kit (Clontech, Inc) is an example of a system useful to generate such 
libraries. 

cDNA clones generated as reflective of the transcriptome of a specific cell type may then be amplified, 
and processed to evaluate the identity of the nucleic acid clones. For example, multiple clones may be 
picked and used as template for PCR amplification. The PCR products may then be arrayed onto 
25 membranes or glass slides to create nucleic acid arrays. For expression profiling, these arrays are then 
hybridised to complex nucleic acid probes in order to quantitate the abundance of individual genes 
contained in the probes. 

A recent summary of nucleic acid array technology that is useful in the analysis of the transcriptome of a 
cell population is provided in Nature Genetics, (1999) (21 suppl; 1-61). There are various types of array 
30 technology currently used, including "microarrays", or "chips", which are high density cDNA arrays 
produced on glass slides, often produced using photolithography. A second type of array is the 
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"macroarray", which is an array with sub-millimetre spot-spot distances produced on a nylon membrane. 
One example of this type of array are the nylon-based microarrays sold commercially by Research 
Genetics Inc. (termed Research Genetics Human GeneFilters) that each contain 5,300 cDNA fragments of 
known identity. The whole series of arrays covers some 35,000 cDNA fragments. This particular array 
5 system (and others like it) allow the identification of transcripts that are down-regulated, as well as those 
that are up-regulated, since the range of genes used to manufacture the arrays are not biased. 

The step of comparison may be effected by utilising subtracted cDNA libraries. Using this approach, the 
transcriptome of one specialised cell type under first experimental conditions is subtracted against the 
transcriptome under second experimental conditions. This reveals the differences in expression under the 
10 two experimental conditions tested. When this is performed for both specialised cell types, the differential 
regulation of gene expression under the two experimental conditions is revealed. 

The step of comparison is through the detection of genes that are differentially regulated in the two 
specialised cell types examined under the first and second experimental conditions. As an example, a 
human cardiomyoblast (cell type A) and a human macrophage (cell type B) may be placed at the same 

15 temperature and at a high oxygen tension (first experimental conditions [1]). Cells from the same cell 
types are also incubated at this temperature, yet under conditions of low oxygen tension (second 
experimental conditions [2]). In this simple example, there are then a minimum of four combinations of 
cell type and condition, A[l], B[l], A[2] and B[2]. "Snapshots" are taken of the transcriptomes of both 
cell types under the "normoxic" and the "hypoxic" experimental conditions, by preparing messenger 

20 RNA from all four combinations. Differences in the regulation of genes can then be analysed, for 
example, using a process of subtractive hybridisation. 

The mechanism of transcriptome comparison in the above example may be as follows. Subtracted cDNA 
libraries are separately prepared for hypoxic macrophages and cardiomyoblasts; for both cell types, their 
cDNA under normoxic conditions is subtracted against their cDNA under hypoxic conditions. This might 

25 be effected by harvesting RNA from cells both in normoxia and hypoxia, and preparing cDNA. 
Subtractive hybridization, optionally including suppression PCR, may then be performed to remove genes 
from the hypoxic cell cDNA which are also present in cDNA from normoxic cells. Insert DNA from 
these subtracted libraries can then be amplified and arrayed onto duplicate membranes. Quantitative 
hybridization with pre-library cDNA material (normoxia and hypoxia) then allows the comparison of 

30 differentially-expressed clones in the two cell types. The clones representing hypoxia-inducible genes 
may be then be identified, for example, by sequencing. 
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Other techniques that are suitable for the analysis of the transcriptome of a specific cell type include serial 
analysis of gene expression (SAGE; Velculescu et al, Science (1995) 270; 484-487), Selective 
amplification via biotin- and restriction-mediated enrichment (SABRE) (Lavery et al, (1997), PNAS USA 
94: p6831-6836); Differential display (for example, indexing differential display reverse transcriptase 
5 polymerase chain reaction (DDRT-PCR; Mahadeva et al (1998) J. Mol.Biol. 284, 1391-1398)); 
representational difference analysis (RDA) (Hubank (1999) Methods in Enzymology 303: 325-349); 
differential screening of cDNA libraries (see Sagerstrom et al (1997) Annu. Rev. Biochem. 66: 751-783); 
"Advanced Molecular Biology", R.M. Twyman (1998) Bios Scientific Publishers, Oxford; "Nucleic Acid 
Hybridization", M, L. M. Anderson (1999) Bios Scientific Publishers, Oxford); Northern blotting; RNAse 
10 protection assays; Sl-nuclease protection assays; RT-PCR; real time RT-PCR (Taq-man); EST 
sequencing; massively parallel signature sequencing (MPSS); and sequencing by hybridisation (SBH) 
(see. Drmanac R. et al (1999), Methods in Enzymology 303:165-178). Many of these techniques are 
reviewed in "Comparative gene-expression analysis" Trends Biotechnol 1999 Feb;17(2):73-8. 

Methods such as these have been applied widely to study mechanisms of biological response, In 
15 particular, microarrays have been used widely to compare gene expression levels between normal and 
diseased tissue. More typically, however, comparisons are performed to detect changes in gene 
expression that are associated with specific aspects of disease progression or pathology. For instance, a 
study of prostate cancer would examine changes associated with the step-wise progression to full 
malignancy or the dependence on androgens for growth. 

20 Transcriptome analysis is complemented by the analysis of the complete protein make-up of a cell, 
referred to as proteomics, The use of two dimensional SDS-PAGE gels in combination with amino acid 
sequencing by mass spectrometry is currently the most widely-used technique in this field (see 
"Proteomics to study genes and genomes" Akhilesh Pandey and Matthias Mann, (2000), Nature 405: 837- 
846). Additionally, the recent developments in the field of protein and antibody arrays now allow the 

25 simultaneous detection of a large number of proteins. For example, low-density protein arrays on filter 
membranes, such as the universal protein array system (Ge H, (2000) Nucleic Acids Res. 28(2), e3) allow 
imaging of arrayed antigens using standard ELISA techniques and a scanning charge-coupled device 
(CCD) detector. Immuno-sensor arrays have also been developed that enable the simultaneous detection 
of clinical analytes. It is now possible using protein arrays, to profile protein expression in bodily fluids, 

30 such as in sera of healthy or diseased subjects, as well as in patients pre- and post-drug treatment. 

Antibody arrays also facilitate the extensive parallel analysis of numerous proteins that are hypo thetically 
implicated in a disease or particular physiological state. A number of methods for the preparation of 
antibody arrays have recently been reported (see Cahill, Trends in Biotechnology, 2000 7:47-51). 
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It is not the intention here to review studies that have been conducted in this area previously. However, 
one example of a physiological condition that has already received considerable attention is the response 
to hypoxia. Several patent applications have now been published that involve an examination of the 
genetic response to hypoxia (see WOOO/12139, Quark Biotech, Inc.; WO00/12525, Quark Biotech, Inc.; 
5 WO99/09049, Quark Biotech, Inc.; WO99/09046, Quark Biotech, Inc.; W099/48916, The Board of 
Trustees of the Leland Stanford Jr. University). These patent applications generally utilise methods of 
subtractive hybridisation and differential expression gene microarray analysis to examine this genetic 
response in certain cell lines. The studies have implicated specific genes as being either repressed or 
induced under hypoxic conditions as compared to their expression under normoxic conditions. These 
10 genes are taught as being useful generally in all cell types, being involved in the (generic) hypoxic 
response. 

Significantly, the present invention extends this work, and, indeed, defines a significant advance over 
similar work that has been performed on the genetic mechanisms that act in response to other 
physiological or genetic stimuli. The present inventors, using the novel methods disclosed herein, have 
15 discovered that far from being generic, the cellular response to many physiological conditions differs 
markedly between different cell types. The cellular response that has been studied in order to illustrate 
this finding is the response to hypoxia. From these results, it has been inferred herein, quite reasonably, 
that far from being generic, cellular response mechanisms differ widely, depending on cell type. 

This discovery has far-reaching implications as regards the design of therapeutic agents that are effective 
20 to counter a disease or physiological condition. For example, an agent that is effective to prevent the 
drastic effects of hypoxia in a neurone (the effects of which include stroke) might be totally ineffective in 
countering the same effects in a cardiomyocyte (chronic ischemic heart disease). Through analysing the 
mechanism of the hypoxic response in different cell types, it may be, in contrast to the example given 
above, that a particular gene is involved in the hypoxic response in both cardiomyocytes and neurones. 
25 Were this to be the case, this would allow the design of a combined medicament, for example, a 
combined cardioprotective and neuroprotective agent. There thus remains a great need for the 
identification of proteins implicated in the physiological mechanism of hypoxia. 

According to a further aspect of the invention, there are provided genes and proteins that are identified 
using a method according to any one of the above-described aspects of the invention. Certain proteins, 
30 whose sequences are identified herein as SEQ ID Nos: 1,3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 
31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 63, 67, 69, 73, 75, 77, 85, 87, 89, 91, 93, 95, 99, 
103, 113,1 15, 119, 121, 129, 131, 133, 137, 139, 141, 145, 151, 153, 157, 159, 163, 169, 181, 187, 201, 
205, 207 and 209, are functionally annotated for the first time. At present, all of these sequences are only 
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identified as "hypothetical proteins" in the public databases. Each and every one of these sequences forms 
an embodiment of this aspect of the invention, 

The invention also includes proteins whose amino acid sequences are encoded by a nucleic acid sequence 
recited in various cDNAs and ESTs deposited in the public databases, or encoded by a gene identified 
5 from such an EST. These cDNAs and ESTs are presented herein as SEQ ID Nos: 62, 64, 66, 68, 70, 72, 
74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 92a, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 116, 1 18, 
120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 
162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 
204, 206, 208, 210, 212, 214 and 216. At present, all of these cDNA and EST sequences are functionally 
10 unannotated in the public databases. Each and every one of these sequences forms an embodiment of this 
aspect of the invention. 

One embodiment of this aspect of the invention provides substantially purified polypeptide, which 
polypeptide: 

i) comprises the amino acid sequence as recited in any one of SEQ ID Nos: 1,3,5,7,9,11,13, 
15 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 63, 67, 

69,73,75,77, 85, 87,89,91,93,95,99, 103, 1 13, 1 15, 1 19, 121, 129, 131, 133, 137, 139, 141, 
145, 151, 153, 157, 159, 163, 169, 181, 187, 201, 205, 207 or 209; 

ii) has an amino acid sequence encoded by a nucleic acid sequence recited in any one of SEQ 
ID Nos: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 

20 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 92a, 94, 96, 

98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 120, 122, 124, 126, 128, 130, 132, 134, 
136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 
174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 
212, 214 and 216 or encoded by a gene identified from an EST recited in any one of these SEQ 

25 ID Nos; 

iii) is a fragment of a polypeptide according to i) or ii), provided that said fragment retains a 
biological activity possessed by the full length polypeptide of i) or ii), or has an antigenic 
determinant in common with the polypeptide of i) or ii); or 

iv) is a functional equivalent of a polypeptide of i), ii) or (iii). 

30 The polypeptide sequences recited in SEQ ID Nos: 1, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 
33,35,37,39,41,43,45,47,49,51,53,55,57,59,63,67,69,73,75,77,85,87,89,91,93,95,99, 103, 
113, 115, 1 19, 121, 129, 131, 133, 137, 139, 141, 145, 151, 153, 157, 159, 163, 169, 181, 187,201,205, 
207 and 209 were, prior to the present disclosure, unannotated in the literature and public sequence 
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databases. Accordingly, until now, no biological function has been attributed to these polypeptide 
sequences; each of these sequences is generally labelled in the databases as a "hypothetical protein". The 
methods of the present invention, described above, have now elucidated a biological function for these 
polypeptides, in that they have been found to be differentially regulated under physiological conditions of 
5 hypoxia, 

These discoveries allow the development of regulators, such as small drug molecules, that affect the 
activity of these polypeptides, so allowing diseases and physiological conditions that are caused by 
hypoxia, or in which hypoxia has been implicated, to be treated. These discoveries also allow the 
development of diagnostic agents that are suitable for the detection of hypoxia in biological tissues and, 
10 through the identification of mutations and polymorphisms (such as SNPs) within genes coding for the 
proteins implicated herein, allows the assessment of an individual's risk of being susceptible to diseases 
and physiological conditions in which hypoxia is implicated. 

The biological activity of polypeptides whose sequences are listed in SEQ ID Nos: 1, 3, 5, 7, 9, 11, 13, 
15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 63, 67, 69, 73, 75, 

15 77, 85, 87, 89, 91, 93, 95, 99, 103, 1 13, 1 15, 119, 121, 129, 131, 133, 137, 139, 141, 145, 151, 153, 157, 
159, 163, 169, 181, 187, 201, 205, 207 and 209 has been found to be hypoxia-regulated. The expression 
of some of these polypeptides has been found to be induced under conditions of hypoxia, whilst the 
expression of other polypeptides has been found to be repressed. By "hypoxia-induced" is meant that the 
polypeptide is expressed at a higher level when a cell is exposed to hypoxic conditions as compared to its 

20 expression level under normoxic conditions. By "hypoxia-repressed" is meant that the polypeptide is 
expressed at a lower level when a cell is exposed to hypoxic conditions as compared to its expression 
level under normoxic conditions. 

The following polypeptides have been found to be hypoxia-induced: those polypeptides whose amino 
acid sequence is recited in SEQ ID Nos: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 63, 67, 

25 69, 73, 75, 77, 85, 87, 89, 91, 93, 95, 99, 103, 113, 1 15, 119, 121, 129, 131, 133, 137, 139 and 141; and 
those polypeptides whose amino acid sequence is encoded by a nucleic acid sequence recited in SEQ ID 
Nos,: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 
86, 88, 90, 92, 92a, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 120, 122, 124, 126, 128, 
130, 132, 134, 136, 138, 140, 142 and 144 or is encoded by a gene identified from an EST recited in any 

30 one of these SEQ ID Nos.. 

The following polypeptides have been found to be hypoxia-repressed: those polypeptides whose amino 
acid sequence is recited in SEQ ID Nos.: 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 145, 151, 153, 
157, 159, 163, 169, 181, 187, 201, 205, 207 and 209; and those polypeptides whose amino acid sequence 
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is encoded by a nucleic acid sequence recited in SEQ ID Nos.: 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 
58, 60, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 
184, 186, 1 88, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214 and 216, or encoded by a 
gene identified from an EST recited in any one of these SEQ ID Nos. 

5 For the purposes of this document, the term "hypoxia" should be taken to mean an environment of 
oxygen tension such that the oxygen content is between about 5% and 0.1 % (v/v), In most cases, hypoxic 
tissue will have an oxygen content that is less than or equal to about 2%. The term "normoxia" should be 
taken to mean conditions comprising a normal level of oxygen for the environment concerned. Normoxic 
tissue typically has an oxygen content above about 5%. 

10 The polypeptide sequences whose amino acid sequence is encoded by a nucleic acid sequence recited in 
SEQ ID Nos: 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 92a, 94, 96, 98, 100, 102, 104, 
106, 108, 1 10, 1 12, 1 14, 1 16, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 
148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 
190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214 and 216, or whose amino acid sequence 

15 is encoded by a gene identified from an EST recited in any one of these SEQ ID Nos,, were also, prior to 
the present disclosure, unannotated in the literature and public sequence databases, meaning that until 
now, no biological function has been attributed to these polypeptide sequences. 

The sequences in this group fall into a number of different categories, The first of these are cDNA clones, 
for which a protein sequence has not been predicted by the depositor. A second category is expressed 

20 sequence tag (EST) sequences that are represented in the UniGene database 
(http://www.ncbi.nlm.nih.gov/UniGene/), which contain modest or weak homology to known proteins 
when translated. ESTs are single-pass sequence files of the 5' region of an organism's expressed genome 
as accessed via a force cloned cDNA library. EST sequences tend to be short and as a general rule are 
error-prone, UniGene (see http://www.ncbi.nlm.nih.gov/Web/Newsltr/aug96.html for review) is an 

25 experimental system for automatically partitioning these EST sequences into a non-redundant set of gene- 
oriented clusters, Each UniGene cluster contains sequences that represent a unique gene, as well as 
related information such as the tissue types in which the gene has been expressed and map location. A 
third category of hits identified by the methods described herein is EST sequences that are contained in 
Unigene clusters, but which are not annotated and exhibit no homologies to proteins contained in the 

30 public databases. The fourth and final category encompasses singleton EST sequence entries that are not 
incorporated as entries in the Unigene database and that only appear as single entries in the public 
databases. 
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The methods of the present invention, described above, have now elucidated a biological function for 
polypeptides that are encoded by genes incorporating cDNA and EST sequences that fall into the four 
categories set out above, in that these sequences have been found to be differentially regulated under 
physiological conditions of hypoxia. Such polypeptides may have an amino acid sequence that is encoded 
5 by a nucleic acid sequence recited in any one of SEQ ID Nos: 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 
84, 86, 88, 90, 92, 92a, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 116, 118, 120, 122, 124, 126, 
128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 
170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 
212, 214 and 216. However, the EST sequences in particular may not be part of the actual coding 

10 sequence for a gene, often representing regulatory regions of the gene, or regions that are transcribed, but 
not translated into polypeptide. Accordingly, this aspect of the invention also includes polypeptides that 
..are encoded by a gene identified from an EST recited in any one of SEQ ID Nos: 62, 64, 66, 68, 70, 72, 
74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 92a, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 1 14, 1 16, 1 18, 
120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 

15 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 
204,206, 208, 210, 212, 214 and 216; 

Polypeptides of this aspect of the invention are intended to include fragments of polypeptides according 
to i) or ii) as defined above, provided that the fragment retains a biological activity that is possessed by 
the full length polypeptide of i) or ii), or has an antigenic determinant in common with the polypeptide of 

20 i) or ii). As used herein, the term "fragment" refers to a polypeptide having an amino acid sequence that is 
the same as part, but not all, of an amino acid sequence as recited in any one of SEQ ID Nos: 1, 3, 5, 7, 9, 
1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 63, 67, 69, 
73, 75, 77, 85, 87, 89, 91, 93, 95, 99, 103, 113, 115, 119, 121, 129, 131, 133, 137, 139, 141, 145, 151, 
153, 157, 159, 163, 169, 181, 187, 201, 205, 207 and 209, an amino acid sequence that is encoded by a 

25 nucleic acid sequence recited in any one of SEQ ID Nos. 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 
88, 90, 92, 92a, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 112, 1 14, 1 16, 1 18, 120, 122, 124, 126, 128, 
130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 
172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 
214 and 216, or an amino acid sequence that is encoded by a gene that is linked to a nucleic acid sequence 

30 recited in any one of these SEQ ID Nos. The fragments should comprise at least n consecutive amino 
acids from the sequence and, depending on the particular sequence, r preferably is 7 or more (for 
example, 8, 10, 12, 14, 16, 18, 20 or more). Small fragments may form an antigenic determinant. 

Such fragments may be isolated fragments, that are not part of or fused to other amino acids or 
polypeptides, or they may be comprised within a larger polypeptide, of which they form a part or region. 
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When comprised within a larger polypeptide, a fragment of the invention most preferably forms a single 
continuous region. For instance, certain preferred embodiments relate to a fragment having a pre - and/or 
pro- polypeptide region fused to the amino terminus of the fragment and/or an additional region fused to 
the carboxyl terminus of the fragment. However, several fragments may be comprised within a single 
5 larger polypeptide. 

The polypeptides of the present invention or their immunogenic fragments (comprising at least one 
antigenic determinant) can be used to generate ligands, such as polyclonal or monoclonal antibodies, that 
are immunospecific for the polypeptides. Such antibodies may be employed to isolate or to identify 
clones that express a polypeptide according to the invention or, for example, to purify the polypeptide by 
10 affinity chromatography. Such antibodies may also be employed as diagnostic or therapeutic aids, 
amongst other applications, as will be apparent to the skilled reader. 

The term "immunospecific" means that an antibody has substantially greater affinity for a polypeptide 
according to the invention than their affinity for related polypeptides. As used herein, the term "antibody" 
is intended to include intact molecules as well as fragments thereof, such as Fab, F(ab 1 ) 2 and scFv, which 
15 are capable of binding to the antigenic determinant in question. 

The invention also includes functional equivalents of a polypeptide of i), ii) or (iii) as recited above. A 
functionally-equivalent polypeptide according to this aspect of the invention may be a polypeptides that is 
homologous to a polypeptide whose sequence is explicitly recited herein. Two polypeptides are said to be 
"homologous" if the sequence of one of the polypeptides has a high enough degree of identity or 

20 similarity to the sequence of the other polypeptide for the skilled person to determine that they are similar 
in origin and function. Preferably, homology is used to refer to sequence identity. "Identity" indicates that 
at any particular position in the aligned sequences, the amino acid residue is identical between the 
sequences. "Similarity" indicates that, at any particular position in the aligned sequences, the amino acid 
residue is of a similar type between the sequences. Degrees of identity and similarity can be readily 

25 calculated according to methods known in the art (see, for example, Computational Molecular Biology, 
Lesk, A.M., ed., Oxford University Press, New York, 1988; Biocomputing. Informatics and Genome 
Projects, Smith, D.W., ed., Academic Press, New York, 1993). Advantageously, the BLAST algorithm is 
employed, with parameters set to default values. The BLAST algorithm is described in detail at 
http://www.ncbi.nih.gov/BLAST/blast_help.html, which is incorporated herein by reference. The search 

30 parameters are defined as follows, and are advantageously set to the defined default parameters. 

Advantageously, "substantial homology" when assessed by BLAST equates to sequences which match 
with an EXPECT value of at least about 7, preferably at least about 9 and most preferably 10 or more. 
The default threshold for EXPECT in BLAST searching is usually 10. 
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BLAST (Basic Local Alignment Search Tool) is the heuristic search algorithm employed by the programs 
blastp, blastn, blastx, tblastn, and tblastx; these programs ascribe significance to their findings using the 
statistical methods of Karlin and Altschul (see http://www.ncbi.nih.gov/BLAST/blast_helpihtml) with a 
few enhancements. The BLAST programs were tailored for sequence similarity searching, for example to 
5 identify homologues to a query sequence. The programs are not generally useful for motif-style 
searching. For a discussion of basic issues in similarity searching of sequence databases, see Altschul et 
al. (1994) Nature Genetics 6:119-129. 

The five BLAST programs available at http://www.ncbi.nlm.nih.gov perform the following tasks: 

blastp compares an amino acid query sequence against a protein sequence database; 

10 blastn compares a nucleotide query sequence against a nucleotide sequence database; 

blastx compares the six-frame conceptual translation products of a nucleotide query sequence (both 
strands) against a protein sequence database; 

tblastn compares a protein query sequence against a nucleotide sequence database dynamically translated 
in all six reading frames (both strands). 

15 tblastx compares the six-frame translations of a nucleotide query sequence against the six-frame 
translations of a nucleotide sequence database. 

BLAST uses the following search parameters: 

HISTOGRAM Display a histogram of scores for each search; default is yes. (See parameter H in the 
BLAST Manual). 

20 DESCRIPTIONS Restricts the number of short descriptions of matching sequences reported to the 
number specified; default limit is 100 descriptions. (See parameter V in the manual page). See also 
EXPECT and CUTOFF. 

ALIGNMENTS Restricts database sequences to the number specified for which high-scoring segment 
pairs (HSPs) are reported; the default limit is 50. If more database sequences than this happen to satisfy 
25 the statistical significance threshold for reporting (see EXPECT and CUTOFF below), only the matches 
ascribed the greatest statistical significance are reported. (See parameter B in the BLAST Manual). 

EXPECT The statistical significance threshold for reporting matches against database sequences; the 
default value is 10, such that 10 matches are expected to be found merely by chance, according to the 
stochastic model of Karlin and Altschul (1990). If the statistical significance ascribed to a match is greater 
30 than the EXPECT threshold, the match will not be reported, Lower EXPECT thresholds are more 
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stringent, leading to fewer chance matches being reported. Fractional values are acceptable. (See 
parameter E in the BLAST Manual), 

CUTOFF Cutoff score for reporting high-scoring segment pairs. The default value is calculated from the 
EXPECT value (see above). HSPs are reported for a database sequence only if the statistical significance 
5 ascribed to them is at least as high as would be ascribed to a lone HSP having a score equal to the 
CUTOFF value, Higher CUTOFF values are more stringent, leading to fewer chance matches being 
reported, (See parameter S in the BLAST Manual). Typically, significance thresholds can be more 
intuitively managed using EXPECT. 

MATRIX Specify an alternate scoring matrix for BLASTP, BLASTX, TBLASTN and TBLASTX. The 
10 default matrix is BLOSUM62 (Henikoff & Henikoff, 1992). The valid alternative choices include: 
PAM40, PAM120, PAM250 and IDENTITY. No alternate scoring matrices are available for BLASTN; 
specifying the MATRIX directive in BLASTN requests returns an error response. 

STRAND Restrict a TBLASTN search to just the top or bottom strand of the database sequences; or 
restrict a BLASTN, BLASTX or TBLASTX search to just reading frames on the top or bottom strand of 
15 the query sequence. 

FILTER Mask off segments of the query sequence that have low compositional complexity, as 
determined by the SEG program of Wootton & Federhen (1993) Computers and Chemistry 17:149-163, 
or segments consisting of short-periodicity internal repeats, as determined by the XNU program of 
Claverie & States (1993) Computers and Chemistry 17:191-201, or, for BLASTN, by the DUST program 
20 of Tatusov and Lipman (see http://www.ncbi.nlm.nih.gov). Filtering can eliminate statistically significant 
but biologically uninteresting reports from the blast output (e.g., hits against common acidic-, basic- or 
proline-rich regions), leaving the more biologically interesting regions of the query sequence available for 
specific matching against database sequences. 

Low complexity sequence found by a filter program is substituted using the letter W N" in nucleotide 
25 sequence (e.g., "NNNNNNNNNNNNN") and the letter "X" in protein sequences (e.g., 
"XXXXXXXXX"). 

Filtering is only applied to the query sequence (or its translation products), not to database sequences. 
Default filtering is DUST for BLASTN, SEG for other programs. 

It is not unusual for nothing at all to be masked by SEG, XNU, or both, when applied to sequences in 
30 SWISS-PROT, so filtering should not be expected to always yield an effect. Furthermore, in some cases, 
sequences are masked in their entirety, indicating that the statistical significance of any matches reported 
against the unfiltered query sequence should be suspect. 
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NCBI-gi Causes NCBI gi identifiers to be shown in the output, in addition to the accession and/or locus 
name, 

Most preferably, sequence comparisons are conducted using the simple BLAST search algorithm 
provided at http://www»ncbi.nlm .nih.gov/BLAST, 

5 Alternatively, sequence homology may be determined by algorithms such as FastA, available at 
http://biology.ncsa.uiuc.edu/BW30/BW.cgi, FastA is considered to be superior to BLAST for alignment 
of short sequences, Advantageously, the FastA algorithm is employed using default parameters at 
http://biology,ncsa, uiuc.edu/BW30/BW.cgi. 

Typically, greater than 50% identity between two polypeptides is considered to be an indication of 
10 functional equivalence, provided that either the biological activity of the polypeptide is retained or the 
polypeptides possess an antigenic determinant in common, Preferably, a functionally equivalent 
polypeptide according to this aspect of the invention exhibits a degree of sequence identity with a 
polypeptide sequence explicitly identified herein, or with a fragment thereof, of greater than 50%. More 
preferred polypeptides have degrees of identity of greater than 60%, 70%, 80%, 90%, 95%, 98% or 99%, 
15 respectively. 

Functionally-equivalent polypeptides according to the invention are therefore intended to include natural 
biological variants (for example, allelic variants or geographical variations within the species from which 
the polypeptides are derived) and mutants (such as mutants containing amino acid substitutions, insertions 
or deletions) of the polypeptides whose sequences are explicitly recited herein. Such mutants may include 
20 polypeptides in which one or more of the amino acid residues are substituted with a conserved or non- 
conserved amino acid residue (preferably a conserved amino acid residue) and such substituted amino 
acid residue may or may not be one encoded by the genetic code. Typical such substitutions are among 
Ala, Val, Leu and He; among Ser and Thr; among the acidic residues Asp and Glu; among Asn and Gin; 
among the basic residues Lys and Arg; or among the aromatic residues Phe and Tyr. 

25 Particularly preferred are variants in which several, i.e. between 5 and 10, 1 and 5, 1 and 3, 1 and 2 or just 
1 amino acids are substituted, deleted or added in any combination. Especially preferred are silent 
substitutions, additions and deletions, which do not alter the properties and activities of the protein. Also 
especially preferred in this regard are conservative substitutions, "Mutant" polypeptides also include 
polypeptides in which one or more of the amino acid residues include a substituent group. 

30 As discussed above, using a method according to the above-described aspects of the invention it has now 
been discovered, most surprisingly, that the response to hypoxia differs between different specialised cell 
types or between different physiological states of the same cell type. For example, it has been found that 
in macrophage cells, different polypeptides are induced/repressed during different physiological states. 
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Furthermore, it has been found that a subset of this group of polypeptides are regulated only in activated 
macrophage cells. Macrophages possess various biological activities, including cytotoxic effects towards 
tumour cells and phagocytosis of bacteria or cellular debris. These form an important and potent arm of 
innate immunity, and as such must be finely regulated. In the absence of interactions with pathogens or 
5 other immune cells, the aforementioned activities of the macrophage are greatly reduced (i.e. resting 
macrophages). When given appropriate stimuli, such as contact with the lipopolysaccharide surface of 
bacteria, and/or exposure to T-cell derived interferon gamma, the functional activities of the macrophage 
are greatly potentiated (i.e. activated macrophage). 

The expression of a further subset of these polypeptides has been found herein to be induced in activated 
10 macrophages under conditions of hypoxia, whilst a still further subset has been found herein to be 
repressed in activated macrophages under conditions of hypoxia. 

In resting macrophage cells, it has been found that different polypeptides are induced/repressed during the 
biological response to hypoxia. For example, it has been found that a subset of this group of polypeptides 
are regulated only in resting macrophage cells. The expression of a further subset of these polypeptides 
15 has been found herein to be induced in resting macrophages under conditions of hypoxia, whilst a still 
further subset has been found herein to be repressed in resting macrophages under conditions of hypoxia. 

According to a further aspect of the invention, there is provided a purified and isolated nucleic acid 
molecule that encodes a polypeptide according to any one of the aspects of the invention discussed above. 
Such a nucleic acid molecule may consist of the nucleic acid sequence as recited in any one of SEQ ID 

20 Nos. 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 
58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 92a, 94, 96, 98, 100, 102, 104, 106, 
108, 1 10, 1 12, 1 14, 116, 1 18, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 
150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 1 88, 190, 
192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214 and 216, or form a redundant equivalent or 

25 fragment thereof. This aspect of the invention also includes a purified nucleic acid molecule which 
hydridizes under high stringency conditions with a nucleic acid molecule as described above. 

According to a further aspect of the invention, there is provided an expression vector that contains a 
purified and isolated nucleic acid molecule according to the aspects of the invention described above. The 
invention also incorporates a delivery vehicle, such as a liposome, comprising a nucleic acid according to 
30 the above-described aspects of the invention. 

In a further aspect, the invention provides a host cell transformed with a vector of the above-described 
aspect of the invention. 
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In a still further aspect, the invention provides a ligand that binds specifically to a polypeptide according 
to the above-described aspects of the invention. The ligand may be an antagonist ligand that inhibits the 
biological activity of the polypeptide, or may be an agonist ligand that activates the hypoxia-induced 
activity of the polypeptide to augment or potentiate a hypoxia-induced activity. 

5 In a still further aspect of the invention, there is provided a ligand which binds specifically to, and which 
preferably inhibits the hypoxia-induced activity of, a polypeptide according to any one of the above- 
described aspects of the invention. Such a ligand may, for example, be an antibody that is immunospecific 
for the polypeptide in question. 

According to a further aspect, the invention provides a polypeptide, a nucleic acid molecule, vector or 
10 ligand as described above, for use in therapy or diagnosis of a disease or abnormal physiological 
condition. Preferably, the disease or abnormal physiological condition that is affected by hypoxia; 
examples of such diseases include cancer, ischaemic conditions (such as stroke, coronary arterial disease, 
peripheral arterial disease), reperfusion injury, retinopathy, neonatal stress, preeclapmsia, atherosclerosis, 
inflammatory conditions (including rheumatoid arthritis), hair loss and wound healing. The undesired 
15 celluar process involved in said diseases might include, but is not restricted to; tumorigenesis, 
angiogenesis, apoptosis, inflammation or erythropoiesis. The undesired biochemical processes involved in 
said cellular processes might include, but is not restricted to, glycolysis, gluconeogenesis, glucose 
transportation, catecholamine synthesis, iron transport or nitric oxide synthesis. 

According to the invention, a number of known proteins have also been implicated in the biological 
20 response to hypoxia, The functions of these proteins are known, meaning that these functions have been 
annotated in the public databases. The sequences of these proteins are presented in SEQ ID Nos.: 217, 
219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 
261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299, 301, 
303, 305, 307, 309, 31 1, 313, 315, 317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 
25 345, 347, 349, 351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 381, 383, 385, 
387, 389, 391, 393, 395, 397, 399, 401, 403, 405, 407, 409, 411, 413, 415, 417, 419, 421, 423, 425, 427, 
429, 431, 433, 435, 437, 439, 441, 443, 445, 447, 449, 451, 453, 455, 457, 459, 461, 463, 465, 467, 469, 
471, 473, 475, 477, 479, 481, 483, 485 and 487. 

According to a further aspect of the invention, there is provided a substantially purified polypeptide, 
30 which polypeptide: 

i) comprises the amino acid sequence as recited in any one of SEQ ID Nos: 1, 3, 5, 7, 9, 
1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 
57, 59, 63, 67, 69, 73, 75, 77, 85, 87, 89, 91, 93, 95, 99, 103, 1 13, 115, 119, 121, 129, 



WO 02/46465 



PCT/GB01/05458 



22 

131, 133, 137, 139, 141, 145, 151, 153, 157, 159, 163, 169, 181, 187, 201, 205, 207 or 
209 or any one of SEQ ID Nos.: 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 
239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 
273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299, 301, 303, 305, 
5 307, 309, 311, 313, 315, 317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 

341, 343, 345, 347, 349, 351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 371, 373, 
375, 377, 379, 381, 383, 385, 387, 389, 391, 393, 395, 397, 399, 401, 403, 405, 407, 
409, 411, 413, 415, 417, 419, 421, 423, 425, 427, 429, 431, 433, 435, 437, 439, 441, 
443, 445, 447, 449, 451 , 453, 455, 457, 459, 461, 463, 465, 467, 469, 471, 473, 475, 
10 477, 479, 481, 483, 485, 487, 489 and 491 ; 

ii) has an amino acid sequence encoded by a nucleic acid sequence recited in any one of 
SEQ ID Nos: 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 92a, 94, 96, 
98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 116, 1 18, 120, 122, 124, 126, 128, 130, 132, 
134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 

15 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 

202, 204, 206, 208, 210, 212, 214 and 216, or encoded by a gene identified from an EST 
recited in any one of these SEQ ID Nos; 

iii) is a fragment of a polypeptide according to i) or ii), provided that said fragment retains a 
biological activity possessed by the full length polypeptide of i) or ii), or has an 

20 antigenic determinant in common with the polypeptide of i) or ii); or 

iv) is a functional equivalent of a polypeptide of i), ii) or (iii); 

for use in the diagnosis or therapy of tumourigenesis, angiogenesis, apoptosis, the biological response to 
hypoxia conditions, or a hypoxic-associated pathology. 

The invention also provides a purified and isolated nucleic acid molecule that encodes a polypeptide 
25 according to this aspect of the invention, for use in the diagnosis or therapy of tumourigenesis, 
angiogenesis, apoptosis, the biological response to hypoxia conditions, or a hypoxic-associated pathology. 
The sequences of these molecules are provided in SEQ ID Nos.: 218, 220, 222, 224, 226, 228, 230, 232, 
234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 
276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 
30 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 
360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 392, 394, 396, 398, 400, 
402, 404, 406, 408, 410, 412, 414, 416, 418, 420, 422, 424, 426, 428, 430, 432, 434, 436, 438, 440, 442, 
444, 446, 448, 450, 452, 454, 456, 458, 460, 462, 464, 466, 468, 470, 472, 474, 476, 478, 480, 482, 484, 
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486 and 488. As described above for the EST nucleic acid sequences annotated herein, this aspect of the 
invention includes redundant equivalents and fragments of the sequences explicitly recited in SEQ ID 
Nos.: 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 
258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 
5 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 
342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 
384, 386, 388, 390, 392, 394, 396, 398, 400, 402, 404, 406, 408, 410, 412, 414, 416, 418, 420, 422, 424, 
426, 428, 430, 432, 434, 436, 438, 440, 442, 444, 446, 448, 450, 452, 454, 456, 458, 460, 462, 464, 466, 
468, 470, 472, 474, 476, 478, 480, 482, 484, 486 and 488, and purified nucleic acid molecules which 
10 hybridize under high stringency conditions with such nucleic acid molecules, and vectors containing such 
nucleic acid molecules for use in the diagnosis or therapy of tumourigenesis, angiogenesis, apoptosis, the 
biological response to hypoxia conditions, or a hypoxic-associated pathology, 

This aspect of the invention also includes ligands which bind specifically to, and which preferably inhibit 
the hypoxia-induced activity of, a polypeptide listed in SEQ ID Nos.: 217, 219, 221, 223, 225, 227, 229, 

15 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 
273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299, 301, 303, 305, 307, 309, 31 1, 313, 
315, 317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 347, 349, 351, 353, 355, 
357, 359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 381, 383, 385, 387, 389, 391, 393, 395, 397, 
399, 401, 403, 405, 407, 409, 41 1, 413, 415, 417, 419, 421, 423, 425, 427, 429, 431, 433, 435, 437, 439, 

20 441, 443, 445, 447, 449, 451, 453, 455, 457, 459, 461, 463, 465, 467, 469, 471, 473, 475, 477, 479, 481, 
483, 485 and 487, for use in the diagnosis or therapy of tumourigenesis, angiogenesis, apoptosis, the 
biological response to hypoxia conditions, or a hypoxic-associated pathology* 

The invention also provides a pharmaceutical composition suitable for modulating hypoxia and/or 
ischaemia, comprising a therapeutically-effective amount of a a polypeptide, a nucleic acid molecule, 
25 vector or ligand as described above, in conjunction with a pharmaceutically-acceptable carrier. 

The invention also provides a vaccine composition comprising a polypeptide, or a nucleic acid molecule 
as described above. 

The invention also provides a method of treating a disease in a patient in need of such treatment by 
administering to a patient a therapeutically effective amount of a polypeptide, a nucleic acid molecule, 
30 vector, ligand or pharmaceutical composition as described above. For diseases in which the expression of 
the natural gene or the activity of the polypeptide is lower in a diseased patient when compared to the 
level of expression or activity in a healthy patient, the polypeptide, nucleic acid molecule, ligand, 
compound or composition administered to the patient should be an agonist. For diseases in which the 
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expression of the natural gene or activity of the polypeptide is higher in a diseased patient when compared 
to the level of expression or activity in a healthy patient, the polypeptide, nucleic acid molecule, vector, 
ligand, compound or composition administered to the patient is an antagonist By the term "agonist" is 
meant herein, any polypeptide, peptide, synthetic molecule or organic molecule that functions as an 
5 activator, by increasing the effective biological activity of a polypeptide, for example, by increasing gene 
expression or enzymatic activity. By the term "antagonist" is meant herein, any polypeptide, peptide, 
synthetic molecule or organic molecule that functions as an inhibitor, by decreasing the effective 
biological activity of the gene product, for example, by inhibiting gene expression of an enzyme or a 
pharmacological receptor. 

10 The invention also provides for the use of a polypeptide, nucleic acid molecule, vector, ligand or 
pharmaceutical composition according to any one of the above-described aspects of the invention in 
modifying the response of a cell to conditions of hypoxia. 

The invention also provides a polypeptide, nucleic acid molecule, vector, ligand or pharmaceutical 
composition according to any one of the above-described aspects of the invention, for use in the 
15 manufacture of a medicament for the treatment of a hypoxia-regulated condition. 

The invention also provides a method of monitoring the therapeutic treatment of disease or physiological 
condition in a patient, comprising monitoring over a period of time the level of expression or activity of 
polypeptide, nucleic acid molecule, vector or ligand in tissue from said patient, wherein altering said level 
of expression or activity over the period of time towards a control level is indicative of regression of said 
20 disease or physiological condition. 

The invention also provides a method of providing a hypoxia regulating gene, an apoptotic or an 
angiogenesis regulating gene by administering directly to a patient in need of such therapy an expressible 
vector comprising expression control sequences operably linked to one or more of the nucleic acid 
molecules as described above. 

25 The invention also provides a method of diagnosing a hypoxia-regulated condition in a patient, 
comprising assessing the level of expression of a natural gene encoding a polypeptide according to any 
one of the aspects of the invention described above in tissue from said patient and comparing said level of 
expression or activity to a control level, wherein a level that is different to said control level is indicative 
of the hypoxia-related condition, 

30 Such a method of diagnosis may be carried out in vitro. One example of a suitable method comprises the 
steps of: (a) contacting a ligand as described above with a biological sample under conditions suitable for 
the formation of a ligand-polypeptide complex; and (b) detecting said complex. 
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A further example of a suitable method may comprises the steps of: a) contacting a sample of tissue from 
the patient with a nucleic acid probe under stringent conditions that allow the formation of a hybrid 
complex between a nucleic acid molecule whose sequence is recited in any one of SEQ ID Nos.: 2, 4, 6, 
8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 
5 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 92a, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 
112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 
154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 
196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 
238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 

10 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 
322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 
364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 392, 394, 396, 398, 400, 402, 404, 
406, 408, 410, 412, 414, 416, 418, 420, 422, 424, 426, 428, 430, 432, 434, 436, 438, 440, 442, 444, 446, 
448, 450, 452, 454, 456, 458, 460, 462, 464, 466, 468, 470, 472, 474, 476, 478, 480, 482, 484, 486 and 

15 488, and the probe; b) contacting a control sample with said probe under the same conditions used in step 
a); and c) detecting the presence of hybrid complexes in said samples; wherein detection of levels of the 
hybrid complex in the patient sample that differ from levels of the hybrid complex in the control sample 
is indicative of the hypoxia-related condition. 

A still further example of a suitable method may comprise the steps of: a) contacting a sample of nucleic 
20 acid from tissue of the patient with a nucleic acid primer under stringent conditions that allow the 
formation of a hybrid complex between a nucleic acid molecule whose sequence is recited in any one of 
SEQ ID Nos.: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 
52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 92a, 94, 96, 98, 100, 102, 
104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 
25 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 
188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 
230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 
272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 312, 
314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 
30 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 392, 394, 396, 
398, 400, 402, 404, 406, 408, 410, 412, 414, 416, 418, 420, 422, 424, 426, 428, 430, 432, 434, 436, 438, 
440, 442, 444, 446, 448, 450, 452, 454, 456, 458, 460, 462, 464, 466, 468, 470, 472, 474, 476, 478, 480, 
482, 484, 486 and 488, and the primer; b) contacting a control sample with said primer under the same 
conditions used in step a); c) amplifying the sampled nucleic acid; and d) detecting the level of 
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amplified nucleic acid from both patient and control samples; wherein detection of levels of the amplified 
nucleic acid in the patient sample that differ significantly from levels of the amplified nucleic acid in the 
control sample is indicative of the hypoxia-related condition. 

A still further example of a suitable method may comprised the steps of; a) obtaining a tissue sample from 
5 a patient being tested for the hypoxia-related condition; b) isolating a nucleic acid molecule according to 
any one of the above-described aspects of the invention from said tissue sample; and c) diagnosing the 
patient for the hypoxia-related condition by detecting the presence of a mutation which is associated with 
the hypoxia-related condition in the nucleic acid molecule as an indication of the hypoxia-related 
condition. This method may comprise the additional step of amplifying the nucleic acid molecule to form 
10 an amplified product and detecting the presence or absence of a mutation in the amplified product, 

Particular hypoxia-related conditions that may be diagnosed in this fashion include cancer, ischaemia, 
reperfusion, retinopathy, neonatal stress, preeclapmsia, atherosclerosis, rheumatoid arthritis, undesired 
hair loss, cardiac arrest or stroke, for example, caused by a disorder of the cerebral, coronary or peripheral 
circulation, 

15 In a further aspect, the invention provides a method for the identification of a compound that is effective 
in the treatment and/or diagnosis of a hypoxia-regulated condition, comprising contacting a polypeptide, 
nucleic acid molecule, or ligand according to any one of the above-described aspects of the invention with 
one or more compounds suspected of possessing binding affinity for said polypeptide, nucleic acid 
molecule or ligand, and selecting a compound that binds specifically to said nucleic acid molecule, 

20 polypeptide or ligand. 

According to a still further aspect of the invention, there is provided a kit useful for diagnosing a hypoxia- 
regulated condition, comprising a first container containing a nucleic acid probe that hybridises under 
stringent conditions with a nucleic acid molecule according to any one of the aspects of the invention 
described above; a second container containing primers useful for amplifying said nucleic acid molecule; 
25 and instructions for using the probe and primers for facilitating the diagnosis of the hypoxia-regulated 
condition. The kit may additionally comprise a third container holding an agent for digesting 
unhybridised RNA, 

To facilitate in the diagnosis of the hypoxia-regulated condition using one of the methods outlined above, 
in a further aspect, the invention provides an array of at least two nucleic acid molecules, wherein each of 
30 said nucleic acid molecules either corresponds to the sequence of, is complementary to the sequence of, 
or hybridises specifically to a nucleic acid molecule according to any one of the aspects of the invention 
described above. Such an array may contain nucleic acid molecules that either correspond to the sequence 
of, are complementary to the sequence of, or hybridise specifically to at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 
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12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 
40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 
68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 92a, 93, 94, 
95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 1 10, 111, 112, 113, 114, 115, 116, 
5 1 17, 1 18, 1 19, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 
138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 
159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 
180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 
201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 21 1, 212, 213, 214, 215, 216, 215, 217, 218, 219, 220, 

10 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 
242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 259, 260, 261, 262, 
263, 264, 265, 266, 267, 268, 269, 270, 271, 272, 273, 274, 275, 276, 277, 278, 279, 280, 281, 282, 283, 
284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295 or more of the nucleic acid molecules 
implicated in a hypoxia-regulated condition as recited above. The nucleic acid molecules on the array 

15 may consist of oligonucleotides of between twelve and fifty nucleotides, more preferably, between forty 
and fifty nucleotides. Alternatively, the nucleic acid molecules on the array may consist of PCR-amplified 
cDNA inserts where the nucleic acid molecule is between 300-2000 nucleotides* 

In a related aspect, again useful for diagnosis, the invention provides an array of antibodies, comprising at 
least two different antibody species, wherein each antibody species is immunospecific with a polypeptide 
20 implicated in a hypoxia-regulated condition as described above. The invention also provides an array of 
polypeptides, comprising at least two polypeptide species as recited above, wherein each polypeptide 
species is implicated in a hypoxia-regulated condition, or is a functional equivalent variant or fragment 
thereof. 

Kits useful in the diagnostic methods of the invention may comprise such nucleic acid, antibody and/or 
25 polypeptide arrays. 

According to the invention, a kit may also comprise one or more antibodies that bind to a polypeptide as 
recited above, and a reagent useful for the detection of a binding reaction between said antibody and said 
polypeptide. 

According to a still further aspect of the invention, there is provided a genetically-modified non-human 
30 animal that has been transformed to express higher, lower or absent levels of a polypeptide according to 
any one of the aspects of the invention described above. Preferably, said genetically-modified animal is a 
transgenic or knockout animal. 
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The invention also provides a method for screening for a compound effective to treat a hypoxia-regulated 
condition, by contacting a non-human genetically-modified animal as described above with a candidate 
compound and determining the effect of the compound on the physiological state of the animal 

As discussed in some detail above, ischaemic disease pathologies involve a decrease in the blood supply 
5 to a bodily organ, tissue or body part generally caused by constriction or obstruction of the blood vessels. 
One particular example of an ischaemic disease pathology is myocardial ischaemia, which encompasses 
several chronic and acute cardiac pathologies that involve the deprivation of the myocardium of its blood 
supply, usually through coronary artery occlusion. A key component of ischaemia is hypoxia. Following 
transient ischaemia, the affected tissue may be subjected to reperfusion and re-oxygenation, and this is of 
10 significance in its own right. 

Ischaemia/reperfusion is well known to induce cell death in myocardial tissue by apoptosis, leading to 
impaired function of the myocardium and infarction. Many of the specific molecules required to execute 
the process of apoptosis are known, but not all of these molecules have been characterised in detail. Cell 
death may also proceed by a distinct process called necrosis, which unlike apoptosis, is not initiated and 

15 controlled by specific and dedicated cellular and biochemical mechanisms (see Nicotera et al, Biochem 
Soc Symp. 1999; 66:69-73). There is substantial evidence that apoptotic cell death occurs either during or 
after myocardial ischaemia (Kajstura et al> Lab Invest. 1996; 74(1):86-107; Cheng et al, Exp Cell Res. 
1996; 226(2):316-27; Fliss and Gattinger, Circ Res. 1996; 79(5):949-56; Veinot et al, Hum Pathol. 1997; 
28(4):485~92; Bialik et al, J Clin Invest. 1997; 1 00(6 ): 1 3 6 3 -72 ; Gottlieb et al, J Clin Invest. 1994; 

20 94(4):1621-8; Gottlieb and Engler, Ann N Y Acad Sci. 1999; 874:412-26). In the laboratory, apoptosis is 
also induced by subjecting cardiac myocytes to hypoxia (Tanaka et al, Circ Res. 1994 Sep;75(3):426-33; 
Long et al, J Clin Invest. 1997 99(1 1): 2635-43). 

Clearly, there is a significant clinical application were a successful method to inhibit apoptosis in 
ischaemic myocardial tissue to be devised. A specific and effective treatment requires identifying 

25 biochemical target(s), which are responsible for mediating apoptosis, specifically in ischaemic myocardial 
cells. One target which plays a common role in mediating apoptosis in many cell types, namely p53, is 
not involved in apoptosis resulting from myocardial ischaemia (Bialik et al, J Clin Invest. 1997; 
1 00(6): 1 363-72), Others have shown that inhibiting key mediators of apoptosis, caspases, provides 
protection against lethal reperfusion injury, following myocardial ischaemia in rat models (Mocanu et al, 

30 Br J Pharmacol. 2000; 1 3 0(2): 1 97-200 ; Yaoita et al, Circulation. 1998 97(3): 276-81; Holly et al, J Mol 
Cell Cardiol. 1999 31(9): 1709-15). However, this approach lacks specificity, since the caspases play a 
key role in mediating apoptosis in the majority of mammalian cell types, where it is usually beneficial. An 
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approach that involves modulating the activity of molecules shown specifically to mediate apoptosis in 
ischaemic cardiac cells, would present a distinct advantage in both specificity and efficacy, 

It has now been discovered that a polypeptide encoded by a gene identified from the EST recited in SEQ 
ID No 86, having the Protein accession number BAB 15101 (encoded by Homo sapiens cDNA: FLJ21620 
5 fis, clone COL07838 Nucleotide accession AK025273) is regulated by hypoxia. Other public domain 
sequences corresponding to this gene include Homo sapiens cDNA: FLJ23265 fis, clone COL06456 
Nucleotide accession AK026918, Accordingly, when referring in the present specification to the EST 
recited in SEQ ID No 86, it is intended that these gene and protein sequences are also embraced. This 
gene was identified using Research Genetics Human GeneFilters arrays, which contain an EST 
10 corresponding to the gene (accession number R00332). In the art, the gene is now termed EGL nine 
(C.elegans) homolog 3, 

There are no reports that describe the function of this human gene. However, a high degree of amino acid 
homology is observed between the protein encoded by this gene, and a rat protein called "Growth factor 
responsive smooth muscle protein" or "SM20" (Nucleotide accession U06713; Protein accession 
15 A53770). An alignment of single letter amino acid sequences is shown below. Over the highlighted 
region there is 97% amino acid similarity and 96% amino acid identity, 
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The high degree of amino acid similarity suggests that the human protein BAB 15101 has an equivalent 
biochemical function to the rat protein A53770 ("Growth factor responsive smooth muscle protein" or 
"SM20"). Recent publications have shown that SM20 functions to promote apoptosis in neurons 
(Lipscomb et al, J Neurochem 1999; 73(l);429-32; Lipscomb et al, J Biol Chem 2000 Nov 1; [epub 
5 ahead of print]). Significantly, SM20 has been shown to be expressed at high levels in the heart (Wax et 
al, J Biol Chem 1994; 269(17): 13041-7). 

It has also been discovered that a polypeptide encoded by a gene identified from the EST recited in SEQ 
ID No 90, having the Protein accession number CAB81622, is regulated by hypoxia. The encoding 
human gene has been annotated in the UniGene database as "Similar to rat smooth muscle protein SM- 
10 20"; the nucleotide sequence is contained within the nucleotide accession AL1 17352, More recently, a 
longer fragment of this gene has been cloned, named clorfl 2, or EGLN1 (Nucleotide accession 
AAG34568; Protein accession AAG34568). Accordingly, when referring in the present specification to 
the EST recited in SEQ ID No 90, it is intended that these gene and protein sequences are also embraced. 

This distinct human gene, encoding a protein related to SM20 and EGLN3 (BAB 15101), is also induced 
15 in response to hypoxia. This gene was identified using Research Genetics Human GeneFilters arrays, 
which contain an EST corresponding to the gene (accession number H56028). 

Independently to this, a fragment of this gene has been cloned from a cDNA library derived from hypoxic 
human cardiomyoblasts, and it has been shown that the gene is increased in expression in response to 
hypoxia in this cell type (see Table 1 herein; penultimate row). The nucleotide sequence of this cDNA 
20 fragment is referred to herein as SEQ ID No 90a. 

In the light of this novel discovery reported herein that these human equivalents of SM20 are induced by 
hypoxia, it is herein proposed that in cardiac ischaemia, the resulting apoptosis is due at least in part, to 
increased expression of these genes. 

The therapeutic modulation of the activity of EGLN3 (BAB15101), clorfl2 (AAG34568), CAB81622, 
25 SM20 and other equivalent proteins and encoding genes therefore provides a novel means for the 
treatment of myocardial ischaemia, through the alteration of the propensity of myocardial cells to undergo 
apoptosis. For example, a suitable treatment may involve altering the susceptibility of ischaemic 
myocardial tissue to subsequent reperfusion and re-oxygenation, or may involve modulating the 
susceptibility of chronic ischaemic myocardial tissue (including forms of angina) to later more severe 
30 ischaemia, which would result in myocardial infarction. It is submitted that, by way of analogy, cerebral 
ischaemia may be treated using the same principle. 

These data provide the first connection between these related genes and the physiological response to 
hypoxia. Recently published research papers have identified that the protein products of these genes can 
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act as proline hydroxylases (see Bruick RK et al Science, 2001 294:1337-40 and Epstein AC et al Cell. 
107:43-54). This is consistent with our observations that certain proline hydroxylases are induced in 
response to hypoxia and the genes EGLN1 and EGLN3 are part of the hypoxia response. For example, 
two genes encoding proline hydroxylases have been identified herein as being increased in expression in 
5 response to hypoxia (proline 4-hydroxylase, alpha polypeptide 1; SeqID: 231/232, proline 4-hydroxylase, 
alpha polypeptide II; SeqID: 349/ 350). This identified a functional significance of proline hydroxylation 
as a response to hypoxia. A preferred embodiment of the invention thus includes methods for modulating 
the biological response to hypoxia by modulating the proline hydroxylase activity of the EGLN3 
(BAB15101), clorfl2 (AAG34568), CAB81622 and SM20 proteins, 

10 Furthermore, a number of bacteria, such as moraxella, are thought to be involved in the initiation of 
inflammatory diseases. Many bacteria contain, within their genome, genes encoding proteins that share 
homology to the EGLN family of prolyl hydroxylases. We therefore propose that these bacterial genes 
may initiate a hypoxic like response at the site of infection thereby causing localised inflammation. The 
resulting inflammatory infiltrate could then cause the tissue to become hypoxic thereby continuing the 

15 cycle of hypoxia response. 

As discussed in detail above, fragments and functional equivalents of the EGLN3 (BAB 15101), cl orfl 2 
(AAG34568), CAB81622, SM20 and other equivalent proteins are included within the present invention, 
in addition to ligands that bind specifically to these proteins. Furthermore, the invention also embraces 
purified and isolated nucleic acid molecules encoding these proteins, fragments and functional 
20 equivalents, vectors containing such nucleic acid molecules and host cells transformed with these vectors. 

The therapeutic and diagnostic applications discussed above are also equally relevant to this aspect of the 
invention. For example, small molecule inhibitors of the EGLN3 (BAB 15101), clorfl2 (AAG34568), 
CAB81622, SM20 and equivalent proteins and encoding genes are envisaged for utility as pharmaceutical 
agents, particularly in modulating the proline hydroxylase activity of the EGLN3 and clorfl2 proteins. 

25 Truncated or chimeric inhibitory derivatives of the encoding genes, or distinct genes that encode 
regulators of the BAB 15101, AAG34568, CAB81622 and SM20 encoding genes, are also envisaged for 
utility for gene therapy. 

An alignment of the amino acid sequences of rat SM20 (Accession A53770), its human equivalent 
(Accession BAB 15101; SEQ ID No: 85) and this distinct human homologue (Accession CAB81622 or 
30 AAG34568; SEQ ID No: 89) is shown below: 

1 50 

BAB15101 (1) 

A53770 (1) 

35 AAG34568 (1) MANDSGGPGGPSPSERDRQYCELCGKMENLLRCSRCRSSFYCCKEHQRQD 
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From this sequence alignment, a highly conserved region of amino acid sequence may be noted, the 
consensus of which is as follows: 

KAMVAC YPGNGTGYVRHVDNPNGDGRC I TC I YYLNKNWDAKLHGGI LRI F PEGKSF I ADVE P I FDRLLFF 
WSDRRNPHEVQ PS YATRYAMTVWYFDAEERAEAKKK 
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This consensus sequence, and variants thereof, may be used in the identification of other proteins that are 
implicated in the biological response to hypoxia. This aspect of the invention therefore provides a 
substantially purified polypeptide comprising the consensus sequence: 

5 KAMVACYPGNGTGYVRHVDNPN^ 

WSDRRNPHEVQPSYATRYAMTVWYFDAEERAEAKKK , or a variant thereof, 

The invention also provides a substantially purified polypeptide comprising the consensus sequence; 
KAMVAC YPGNGTG YVRHVDNPNGDGRC I TC I YYLNKNWDAKLHGGI LRI F PEGKSF I ADVE PI FDRLLFF 
10 WSDRRNPHEVQPSYATRYAMTVWYFDAEERAEAKKK , or a variant thereof, in the treatment or diagnosis 
of a hypoxia-related disease or condition. 

Neither this consensus domain nor any proteins that contain this domain have been previously associated 
with the cellular response to hypoxia/ischaemia, Searches of the public databases indicate that the human 
genome contains several genes that encode proteins that contain this consensus sequence, These proteins 
15 may have similar functions or may function in the same biochemical pathway, potentially with an 
antagonistic effect. 

By "variant" is meant a variation of the consensus sequence given above, that exhibits a degree of 
homology with the consensus sequence above a certain threshold level of identity or similarity, Degrees 
of identity and similarity can be readily calculated according to methods known in the art (see, for 

20 example, Computational Molecular Biology, Lesk, A.M., ed„ Oxford University Press, New York, 1988; 
Biocomputing. Informatics and Genome Projects, Smith, D.W., ed., Academic Press, New York, 1993). 
Typically, greater than 50% identity between two sequences is considered to be an indication of 
functional equivalence. Preferably, a variant consensus according to this aspect of the invention exhibits a 
degree of sequence identity with the consensus sequence given above, of greater than 50%. More 

25 preferred polypeptides have degrees of identity of greater than 60%, 70%, 80%, 90%, 95%, 98% or 99%, 
respectively. 

As discussed in detail above, fragments and functional equivalents of these proteins are included within 
the present invention, in addition to ligands that bind specifically to these proteins. Furthermore, the 
invention also embraces purified and isolated nucleic acid molecules encoding these proteins, fragments 
30 and functional equivalents, vectors containing such nucleic acid molecules and host cells transformed 
with these vectors. The therapeutic and diagnostic applications discussed above are also equally relevant 
to this aspect of the invention. 
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The polypeptide referred to above as that encoded by SEQ ID No 91 is a specific protein that is termed 
"Semaphorin 4b". The gene encoding this protein is regulated (activated) by conditions of hypoxia. The 
Semaphorin 4b protein is encoded by a gene identified from the EST recited in SEQ ID No 92. The 
unequivocal and accurate full length cDNA sequence is provided herein as SEQ ID No 92a, The accurate 
5 presumptive amino acid sequence is provided herein as SEQ ID No 91. This protein, functionally- 
equivalent variants of this protein, the encoding nucleic acid molecules and ligands that regulate the 
activity and/or expression of this gene and protein are claimed above in the context of their role in 
hypoxia and hypoxia-related disorders. 

Semaphores are a large family of proteins, characterised by the 500 amino acid sema domain (Puschel et 
10 aL, 1995, Neuron, 14(5): 941-8; Tamagnone and Comoglio, 2000, Trends Cell Biol., 10(9): 377-83). 
Early work showed a role in the guidance of axons during brain development, and the regulation of cell 
migration. More recently, specific members of this large family have been associated with cancer 
(Brambilla et ah, Am J Pathol., 2000, 156(3): 939-50), rheumatoid arthritis (Mangasser-Stephan et al., 
Biochem Biophys Res Commun., 1997, 234(1): 153-6), the immune system (Spriggs, Curr Opin 
15 Immunol, 1999, 1 1(4): 387-91) including B-lymphocyte functions (Hall et al., Proc Natl Acad Sci USA, 
1996, 93(21): 1 1780-5) and angiogenesis (Miao et al., J Cell Biol., 1999, 146(1): 233-42). This is perhaps 
not surprising considering that cell migration / trafficking is a key part of inflammation, angiogenesis and 
tumour metastasis. 

There are at least distinct 25 human semaphorin genes and the significance/ utility of many of these 
20 remains untested. This includes the Semaphorin 4b protein, which is unpublished and until now has not 
been assigned a full and accurate amino acid sequence. 

We have made experimental discoveries which link the expression of Semaphorin 4b to factors (hypoxia, 
gamma IFN and superoxide radicals) that are associated with a variety of human ischaemic and 
inflammatory diseases. In particular, a key response of cells to hypoxia is to stimulate angiogenesis, and a 

25 key part of inflammation is the recruitment and trafficking of immune cells. In light of our discoveries, 
and what is known about other specific members of the semaphorin family, it is herein proposed that 
Semaphorin 4b is a regulator of these cellular functions, and thus provides a novel target for therapeutic 
intervention. This paves the way for the development of therapeutic agents that either potentiate or 
antagonise functions of Semaphorin 4b. Such agents are likely to be highly valuable in the treatment of 

30 human disease. 

The practice of the present invention will employ, unless otherwise indicated, conventional techniques of 
molecular biology, microbiology, recombinant DNA technology and immunology, which are within the 
skill of those working in the art. 
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Most general molecular biology, microbiology recombinant DNA technology and immunological 
techniques can be found in Sambrook et al, Molecular Cloning, A Laboratory Manual (1989) Cold 
Harbor-Laboratory Press, Cold Spring Harbor, N.Y. or Ausubel et al, Current protocols in molecular 
biology (1990) John Wiley and Sons, N.Y. 

5 Unless defined otherwise, all technical and scientific terms used herein have the same meaning as 
commonly understood by one of ordinary skill in the art to which this invention belongs* 

A. Polypeptides 

The term "polypeptide" as used herein, refers to a chain (may be branched or unbranched) of two or more 
amino acids linked to each other by means of a peptide bond or modified peptide bond (isosteres). The 
10 term polypeptide encompasses but is not limited to oligopeptides, peptides and proteins. The polypeptide 
of the invention may additionally be either in a mature protein form or in a pre-, pro- or prepro-protein 
form that requires subsequent cleavage for formation of the active mature protein. The pre-, pro-, prepro- 
part of the protein is often a leader or secretory sequence but may also be an additional sequence added to 
aid protein purification (for example, a His tag) or to conform a higher stability to the protein. 

15 A polypeptide according to the invention may also include modified amino acids, that is, amino acids 
other than those 20 that are gene-encoded. This modification may be a result of natural processes such as 
post-translational processing or by chemical modification, Examples of modifications include acetylation, 
acylation, amidation, ADP-ribosylation, arginylation, attachment of a lipid derivative or 
phosphatidylinositol, y-carboxylation, covalent attachment of a flavin or haeme moiety, a nucleotide or 

20 nucleotide derivative, cyclisation, demethylation, disulphide bond formation, formation of covalent cross- 
links, formylation, glycosylation, GPI anchor formation, hydroxylation, iodination, lipid attachment, 
methylation, myristoylation, oxidation, proteolytic processing, phosphorylation, prenylation, 
racemisation, selenoylation, sulphation, and ubiquitination. Modification of the polypeptide can occur 
anywhere within the molecule including the backbone, the amino acid side-chains or at the N- or C- 

25 terminals. 

A polypeptide according to the invention may either be isolated from natural sources (for example, 
purified from cell culture), or be a recombinantly produced polypeptide, or a synthetically produced 
polypeptide or a combination of all the above. 

Antibodies 

30 A polypeptide according to the invention, its functional equivalents and/or any immunogenic fragments 
derived from the polypeptide may be used to generate ligands including immunospecific monoclonal or 
polyclonal antibodies, or antibody fragments. These antibodies can then be used to isolate or identify 
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clones expressing the polypeptide of the invention or to purify the polypeptide by affinity 
chromatography. Further uses of these immunospecific antibodies may include, but are not limited to, 
diagnostic, therapeutic or general assay applications. Examples of assay techniques that employ 
antibodies are immunoassays, radioimmunoassays (RIA) or enzyme linked immunosorbent assay 
5 (ELISA). In these cases, the antibodies may be labelled with an analytically-detectable reagent including 
radioisotopes, a fluorescent molecule or any reporter molecule. 

The term "immunospecific" as used herein refers to antibodies that have a substantially higher affinity for 
a polypeptide of this invention compared with other polypeptides. The term "antibody" as used herein 
refers to a molecule that is produced by animals in response to an antigen and has the particular property 

10 of interacting specifically with the antigenic determinant that induced its formation. Fragments of the 
aforementioned molecule such as Fab, F(ab') 2 and scFv, which are capable of binding the antigen 
"determinant, are also included in the term "antibody". Antibodies may also be modified to make chimeric 
antibodies, where non-human variable regions are joined or fused to human constant regions (for 
example, Liu et al, PNAS, USA, 84, 3439 (1987)). Particularly, antibodies may be modified to make 

15 them less immunogenic to an individual in a process such as humanisation (see, for example, Jones et al, 
Nature, 321, 522 (1986); Verhoeyen et al, Science, 239, 1534 (1988); Kabat et al, J, Immunol., 147, 
1709 (1991); Queen et al, PNAS, USA, 86, 10029 (1989); Gorman et al, PNAS, USA, 88, 34181 (1991) 
and Hodgson et al, Bio/Technology, 9, 421 (1991)). The term "humanised antibody", as used herein, 
refers to antibody molecules in which the amino acids of the CDR (complementarity-determining region) 

20 and selected other regions in the variable domains of the heavy and/or light chains of a non-human donor 
antibody have been substituted with the equivalent amino acids of a human antibody. The humanised 
antibody therefore closely resembles a human antibody, but has the binding ability of the donor antibody. 
Antibodies may also have a "bispecific" nature, that is, the antibody has two different antigen binding 
domains, each domain being directed against a different epitope. 

25 Specific polyclonal antibodies may be made by immuno-challenging an animal with a polypeptide of this 
invention, Common animals used for the production of antibodies include the mouse, rat, chicken, rabbit, 
goat and horse. The polypeptide used to immuno-challenge the animal may be derived by recombinant 
DNA technology or may be chemically-synthesised. In addition, the polypeptide may be conjugated to a 
carrier protein. Commonly used carriers to which the polypeptides may be conjugated include, but are not 

30 limited to BSA (bovine serum albumin), thyroglobulin and keyhole limpet haemocyanin. Serum from the 
immuno-challenged animal is collected and treated according to known procedures, for example, by 
immunoaffinity chromatography. 

Specific monoclonal antibodies can generally be made by methods known to one skilled in the art (see for 
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example, Kohler, G. and Milstein, C, Nature 256, 495-497 (1975); Kozbor et al, Immunology Today 4; 
72 (1983); Cole et al, 77-96 in Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc. (1985) 
and Roitt, L et al, Immunology, 25.10, Mosby-Year Book Europe Limited (1993)), Panels of monoclonal 
antibodies produced against the polypeptides of the invention can be screened for various properties, i.e., 
5 for isotype, epitope, affinity, etc. against which they are directed. Alternatively, genes encoding the 
monoclonal antibodies of interest may be isolated from hybridomas, for instance using PCR techniques 
known in the art, and cloned and expressed in appropriate vectors. 

Phage display technology may be utilised to select the genes encoding the antibodies that have exhibited 
an immunspecific response to the polypeptides of the invention (see McCafferty, J., et al, (1990), Nature 
10 348, 552-554; Marks, J. et al, (1992) Biotechnology 10, 779-783). 

Ligands 

The polypeptides of the invention may also be used to search for interacting ligands. Methods for doing 
this include the screening of a library of compounds (see Coligan et al, Current Protocols in Immunology 
1(2); Chapter 5 (1991), isolating the ligands from cells, isolating the ligands from a cell-free preparation 

15 or natural product mixtures. Ligands to the polypeptide may activate (agonise) or inhibit (antagonise) its 
activity. Alternatively, compounds may affect the levels of the polypeptide present in the cell, including 
affecting gene expression, mRNA stability and the degree of post-translational modification of the 
encoded protein. The invention thus embraces methods for the identification of a compound that is 
effective in the treatment and/or diagnosis of disease, comprising contacting a polypeptide, a nucleic acid 

20 molecule or host cell according to any one of the embodiments of the invention described herein with one 
or more compounds suspected of possessing binding affinity for said polypeptide or nucleic acid 
molecule, and selecting a compound that binds specifically to said nucleic acid molecule or polypeptide, 
or that affects the level of gene expression, mRNA stability or the degree of post-translational 
modification of the encoded protein. 

25 Ligands to the polypeptide form a further aspect of the invention, as discussed in more detail above. 
Preferred "antagonist" ligands include those that bind to the polypeptide of this invention and strongly 
inhibit any activity of the polypeptide. Preferred "agonist" ligands include those that bind to the 
polypeptide and strongly induce activity of the polypeptide of this invention or increases substantially the 
level of the polypeptide in the cell. As defined above, the term "agonist" is meant to include any 

30 polypeptide, peptide, synthetic molecule or organic molecule that functions as an activator, by increasing 
the effective biological activity of a polypeptide, for example, by increasing gene expression or enzymatic 
activity. The term "antagonist" is meant to include any polypeptide, peptide, synthetic molecule or 
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organic molecule that functions as an inhibitor, by decreasing the effective biological activity of the gene 
product, for example, by inhibiting gene expression of an enzyme or a pharmacological receptor, 

Ligands to a polypeptide according to the invention may come in various forms, including natural or 
modified substrates, enzymes, receptors, small organic molecules such as small natural or synthetic 
5 organic molecules of up to 2000Da, preferably 800Da or less, peptidomimetics, inorganic molecules, 
peptides, polypeptides, antibodies, structural or functional mimetics of the aforementioned. 

B. Nucleic acid molecules 

Preferred nucleic acid molecules of the invention are those which encode the polypeptide sequences 
recited in any one of SEQ ID Nos. 1, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 

10 41,43,45,47,49,51,53,55,57,59,63,67,69,73,75,77, 85, 87, 89,91,93,95,99, 103, 1 13, 1 15, 119, 
121, 129, 131, 133, 137, 139, 141, 145, 151, 153, 157, 159, 163, 169, 181, 187, 201, 205, 207 and 209. 
Examples of such nucleic acid molecules include those listed in SEQ ID Nos, 2, 4, 6, 8, 10, 12, 14, 16, 18, 
20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 
76, 78, 80, 82, 84, 86, 88, 90, 92, 92a, 94, 96, 98, 100, 102, 104, 106, 108, 110, 1 12, 1 14, 1 16, 1 18, 120, 

15 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 
164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 
206, 208, 210, 212, 214 and 216, homologous nucleic acids and nucleic acids that are complementary to 
these nucleic acid molecules. Nucleic acid molecules of this aspect of the invention may be used in 
numerous methods and applications, as described generally herein. A nucleic acid molecule preferably 

20 comprises of at least n consecutive nucleotides from any one of the sequences disclosed in SEQ ID Nos. 
2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 
60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 92a, 94, 96, 98, 100, 102, 104, 106, 108, 
110, 1 12, 114, 116, 1 18, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 
152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 

25 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214 and 216, where n is 10 or more. A nucleic acid 
molecule of the invention also includes sequences that are complementary to the nucleic acid molecule 
described above (for example, for antisense or probing purposes). 

A nucleic acid molecule according to this aspect of the invention may be in the form of RNA, such as 
mRNA, DNA, such as cDNA, synthetic DNA or genomic DNA, The nucleic acid molecule may be 
30 double-stranded or single-stranded. The single-stranded form may be the coding (sense) strand or the non- 
coding (antisense) strand. A nucleic acid molecule may also comprise an analogue of DNA or RNA, 
including, but not limited to modifications made to the backbone of the molecule, such as, for example, a 
peptide nucleic acid (PNA). The term "PNA" as used herein, refers to an antisense molecule that 
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comprises an oligonucleotide of at least five nucleotides in length linked to a peptide backbone of amino 
acid residues, preferably ending in lysine. The terminal lysine confers solubility to the composition. PNAs 
may be pegylated to extend their lifespan in a cell, where they preferentially bind complementary single- 
stranded DNA and RNA and stop transcript elongation (Nielsen, P.E. et al (1993) Anticancer Drug Des. 
5 8:53-63). 

A nucleic acid molecule according to this aspect of the invention can be isolated by cloning, purification 
or separation of the molecule directly from a particular organism, or from a library, such as a genomic or 
cDNA library. The molecule may also be synthesised, for example, using chemical synthetic techniques 
such as solid phase phosphoramidite chemical synthesis. RNA may be synthesized in vitro or in vivo by 
10 transcription of the relevant DNA molecule. 

Due to the degeneracy of the genetic code, differing nucleic acid sequences may encode the same 
polypeptide (or mature polypeptide). Thus, nucleic acid molecules included in this aspect of the invention 
include any molecule comprising a variant of the sequence explicitly recited. Such variants may include 
variant nucleic acid molecules that code for the same polypeptide (or mature polypeptide) as that 

15 explicitly identified, that code for a fragment of the polypeptide, that code for a functional equivalent of 
the polypeptide or that code for a fragment of the functional equivalent of the polypeptide. Also included 
in this aspect of the invention, are variant nucleic acid molecules that are derived from nucleotide 
substitutions, deletions, rearrangements or insertions or multiple combinations of the aforementioned. 
Such molecules may be naturally occurring variants, such as allelic variants, non-naturally occurring 

20 variants such as those created by chemical mutagenesis, or variants isolated from a species, cell or 
organism type other than the type from which the sequence explicitly identified originated. Variant 
nucleic acid molecules may differ from the nucleic acid molecule explicitly recited in a coding region, 
non-coding region or both these regions. 

Nucleic acid molecules may also include additional nucleic acid sequence to that explicitly recited, for 
25 example, at the 5' or 3 } end of the molecule. Such additional nucleic acids may encode for a polypeptide 
with added functionality compared with the original polypeptide whose sequence is explicitly identified 
herein. An example of this would be an addition of a sequence that is heterologous to the original nucleic 
acid sequence, to encode a fusion protein. Such a fusion protein may be of use in aiding purification 
procedures or enabling techniques to be carried out where fusion proteins are required (such as in the 
30 yeast two hybrid system). Additional sequences may also include leader or secretory sequences such as 
those coding for pro-, pre- or prepro- polypeptide sequences. These additional sequences may also 
include non-coding sequences that are transcribed but not translated including ribosome binding sites and 
termination signals. 
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A nucleic acid molecule of the invention may include molecules that are at least 70% identical over their 
entire length to a nucleic acid molecule as explicitly identified herein in SEQ ID Nos.: 2, 4, 6, 8, 10, 12, 
14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 
70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 92a, 94, 96, 98, 100, 102, 104, 106, 108, 110, 1 12, 1 14, 1 16, 
5 1 18, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 
160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 
202, 204, 206, 208, 210, 212, 214 and 216. Preferably, a nucleic acid molecule according to this aspect of 
the invention comprises a region that is at least 80% identical over its entire length to a nucleic acid 
molecule as explicitly identified herein in these SEQ ID Nos,, preferably at least 90%, more preferably at 
10 least 95% and most preferably at least 98% or 99% identical. Further preferred embodiments include 
nucleic acid molecules that encode polypeptides that retain substantially the same biological function or 
activity as the polypeptide explicitly identified herein. The terms "homology" and "identity" should be 
given the meanings described in detail above with respect to polypeptide analysis. Preferably, nucleotide 
homology and identity are assessed using the blastn program available at http://www.ncbLnlm.nih, gov, 

15 The nucleic acid molecules of the invention can also be engineered using methods generally known in the 
art. These methods include but are not limited to DNA shuffling; random or non-random fragmentation 
(by restriction enzymes or shearing methods) and reassembly of fragments; insertions, deletions, 
substitutions and rearrangements of sequences by site-directed mutagenesis (for example, by PCR). These 
alterations may be for a number of reasons including for ease of cloning (such as introduction of new 

20 restriction sites), altering of glycosylation patterns, changing of codon preferences, splice variants 
changing the processing, and/or expression of the gene product (the polypeptide) in general or creating 
fusion proteins (see above). 

Hybridisation 

Nucleic acid molecules of the invention may also include antisense molecules that are partially 
25 complementary to a nucleic acid molecule as explicitly identified herein in SEQ ID Nos.: 2, 4, 6, 8, 10, 
12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 
68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 92a, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 112, 114, 
116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 
158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 
30 200, 202, 204, 206, 208, 210, 212, 214 and 216, and which therefore will hybridise to the encoding 
nucleic acid molecules. These antisense molecules, including oligonucleotides, can be designed to 
recognise, specifically bind to and prevent transcription of a target nucleic acid encoding a polypeptide of 
the invention, as will be known by those of ordinary skill in the art (see Cohen, J.S., Trends in Pharm. 
Sci., 10, 435 (1989), Okano, J. Neurochem. 56, 560 (1991); O'Connor, J. Neurochem 56, 560 (1991); Lee 
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et al % Nucleic Acids Res 6, 3073 (1979); Cooney et al, Science 241, 456 (1988); Dervan et al, Science 
251,1360 (1991), 

The term "hybridisation" used herein refers to any process by which a strand of nucleic acid binds with a 
complementary strand of nucleic acid by hydrogen bonding, typically forming Watson-Crick base pairs. 
5 As carried out in vitro, one of the nucleic acid populations is usually immobilised to a surface, whilst the 
other population is free. The two molecule types are then placed together under conditions conducive to 
binding. 

The phrase "stringency of hybridisation" refers to the percentage of complementarity that is needed for 
duplex formation. "Stringency" thus refers to the conditions in a hybridization reaction that favour the 

10 association of very similar molecules over association of molecules that differ. Conditions can therefore 
exist that allow not only nucleic acid strands with 99-100% complementarity to hybridise, but sequences 
with lower complementarity (for example, 50%) to also hybridise, High stringency hybridisation 
conditions are defined herein as overnight incubation at 42°C in a solution comprising 50% formamide, 
5XSSC (150mM NaCl, 15mM trisodium citrate), 50mM sodium phosphate (pH7.6), 5x Denhardts 

15 solution, 10% dextran sulphate, and 20 microgram/ml denatured, sheared salmon sperm DNA, followed 
by washing the filters in 0.1X SSC at approximately 65°C. Low stringency conditions involve the 
hybridisation reaction being carried out at 35°C (see Sambrook et al [supra]). Preferably, the conditions 
used for hybridization are those of high stringency. 

Some trans- and ds-acting factors that may affect the binding of two complementary strands include 
20 strand length, base composition (GC pairs have an extra hydrogen bond and are thus require more energy 
to separate than AT pairs) and the chemical environment. The presence of monovalent cations (such as 
Na + ) stabilises duplex formation whereas chemical denaturants such as formamide and urea destabilise 
the duplex by disruption of the hydrogen bonds. Use of compounds such as polyethylene glycol (PEG) 
can increase reassociation speeds by increasing overall DNA concentration in aqueous solution by 
25 abstracting water molecules. Denhardt's reagent or BLOTTO are chemical agents often added to block 
non-specific attachment of the liquid phase to the solid support. Increasing the temperature will also 
increase the stringency of hybridisation, as will increasing the stringency of the washing conditions 
following hybridisation (Sambrook et al [supra]). 

Numerous techniques exist for effecting hybridisation of nucleic acid molecules. Such techniques usually 
30 involve one of the nucleic acid populations being labelled. Labelling methods include, but are not limited 
to radiolabelling, fluorescence labelling, chemiluminescent or chromogenic labelling or chemically 
coupling a modified reporter molecule to a nucleotide precursor such as the biotin-streptavidin system. 
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This can be done by oligolabelling, nick-translation, end-labelling or PGR amplification using a labelled 
polynucleotide. Labelling of RNA molecules can be achieved by cloning the sequences encoding the 
polypeptide of the invention into a vector specifically for this purpose. Such vectors are known in the art 
and may be used to synthesise RNA probes in vitro by the addition of an appropriate RNA polymerase 
5 such as T7, T3 or SP6 and labelled nucleotides. 

Various kits are commercially available that allow the labelling of molecules. Examples include those 
made by Pharmacia & Upjohn (Kalamazoo, MI); Promega (Madison WI); and the U.S. Biochemical 
Corp. (Cleveland, OH). Hybridisation assays include, but are not limited to dot-blots, Southern blotting, 
Northern blotting, chromosome in situ hybridisation (for example, FISH [fluorescence in situ 
10 hybridisation]), tissue in situ hybridisation, colony blots, plaque lifts, gridded clone hybridisation assays, 
DNA microarrays and oligonucleotide microarrays. These hybridisation methods and others, may be used 
by a skilled artisan to isolate copies of genomic DNA, cDNA, or RNA encoding homologous or 
orthologous proteins from other species. 

The invention therefore also embodies a process for detecting a nucleic acid molecule according to the 
15 invention, comprising the steps of: (a) contacting a nucleic probe with a biological sample under 
hybridising conditions to form duplexes: and (b) detecting any such duplexes that are formed. The term 
"probe" as used herein refers to a nucleic acid molecule in a hybridisation reaction whose molecular 
identity is known and is designed specifically to identify nucleic acids encoding homologous genes in 
other species. Usually, the probe population is the labelled population, but this is not always the case, as 
20 for example, in a reverse hybridisation assay. 

One example of a use of a probe is to find nucleic acid molecules with an equivalent function to those that 
are explicitly identified herein, or to identify additional family members in the same or other species. This 
can be done by probing libraries, such as genomic or cDNA libraries, derived from a source of interest, 
such as a human, a non-human animal, other eukaryote species, a plant, a prokaryotic species or a virus, 
25 The probe may be natural or artificially designed using methods recognised in the art (for example, 
Ausubel et al, [supra]). A nucleic acid probe will preferably possess greater than 15, more preferably 
greater than 30 and most preferably greater than 50 contiguous bases complementary to a nucleic acid 
molecule explicitly identified herein. 

In many cases, isolated DNA from cDNA libraries will be incomplete in the region encoding the 
30 polypeptide, normally at the 5' end. Methods available for subsequently obtaining full-length cDNA 
sequence include RACE (rapid amplification of cDNA ends) as described by Frohman et af., (Proc. Natl 
Acad. Sci. USA 85, 8998-9002 (1988)), and restriction-site PCR, which uses universal primers to retrieve 
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unknown nucleic acid sequence adjacent to a known locus (Sarkar, G. (1993) PCR Methods Applic., 
2:318-322). "Inverse PCR" may also be used to amplify or to extend sequences using divergent primers 
based on a known region (Triglia, T. et al, (1988) Nucleic Acids Res, 16:8186). Another method which 
may be used is "capture PCR", which involves PCR amplification of DNA fragments adjacent to a known 
5 sequence in human and yeast artificial chromosome DNA (Lagerstrom, M. et al y (1991) PCR Methods 
Applic, 1:111-119). Another method which may be used to retrieve unknown sequences is that of Parker, 
J.D. et al, (1991); Nucleic Acids Res. 19:3055-3060). Additionally, one may use PCR, nested primers, 
and libraries, such as the PromoterFinder™ library (Clontech, Palo Alto, CA) to walk genomic DNA, 
This latter process avoids the need to screen libraries and is useful in finding intron/exon junctions. 

10 When screening for full-length cDNAs, it is preferable to use libraries that have been size-selected to 
include larger cDNAs. Also, random-primed libraries are preferable, in that they will contain more 
sequences that contain the 5 1 regions of genes. Use of a randomly primed library may be especially 
preferable for situations in which an oligo d(T) library does not yield a full-length cDNA. Genomic 
libraries may be useful for extension of sequence into 5' non-transcribed regulatory regions. 

15 In one embodiment of the invention, a nucleic acid molecule according to the invention may be used for 
chromosome localisation, In this technique, a nucleic acid molecule is specifically targeted to, and can 
hybridise with, a particular location on an individual human chromosome, The mapping of relevant 
sequences to chromosomes is an important step in the confirmatory correlation of those sequences with 
the gene-associated disease, Once a sequence has been mapped to a precise chromosomal location, the 

20 physical position of the sequence on the chromosome can be correlated with genetic map data. Such data 
are found in, for example, McKusick, Mendelian Inheritance in Man (available on-line through Johns 
Hopkins University Welch Medical Library), The relationships between genes and diseases that have 
been mapped to the same chromosomal region are then identified through linkage analysis (coinheritance 
of physically adjacent genes). This provides valuable information to investigators searching for disease 

25 genes using positional cloning or other gene discovery techniques, Once the disease or syndrome has 
been crudely localised by genetic linkage to a particular genomic region, any sequences mapping to that 
area may represent associated or regulatory genes for further investigation. The nucleic acid molecule 
may also be used to detect differences in the chromosomal location due to translocation, inversion, etc. 
among normal, carrier, or affected individuals. 

30 Nucleic acid molecules of the present invention are also valuable for tissue localisation. Such techniques 
facilitate the determination of expression patterns of the polypeptide in tissues by detection of the mRNAs 
that encode them. These techniques include in situ hybridisation techniques and nucleotide amplification 
techniques, such as PCR. Results from these studies provide an indication of the normal functions of the 
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polypeptide in the organism, as well as highlighting the involvement of a particular gene in a disease state 
or abnormal physiological condition, 

In addition, comparative studies of the normal expression pattern of mRNAs with that of mRNAs 
encoded by a mutant gene provide valuable insights into the role of mutant polypeptides in disease. Such 
5 inappropriate expression may be of a temporal, spatial or quantitative nature. 

Vectors 

The nucleic acid molecules of the present invention may be incorporated into vectors for cloning (for 
example, pBluescript made by Stratagene) or expression purposes. Vectors containing a nucleic acid 
molecule explicitly identified herein (or a variant thereof) form another aspect of this invention. The 
10 nucleic acid molecule may be inserted into an appropriate vector by any variety of well known techniques 
such as those described in Sambrook et al [supra], Generally, the encoding gene can be placed under the 
control of a control element such as a promoter, ribosome binding site or operator, so that the DNA 
sequence encoding the desired polypeptide is transcribed into RNA in the transformed host cell. 

Vectors may be derived from various sources including, but not limited to bacterial plasmids, 
15 bacteriophage, transposons, yeast episomes, insertion elements, yeast chromosomal elements, viruses for 
example, baculoviruses and SV40 (simian virus), vaccinia viruses, adenoviruses, fowl pox viruses, 
pseudorabies viruses, lentiviruses and retroviruses, or combinations thereof, such as those derived from 
plasmid and bacteriophage genetic elements, including cosmids and phagemids. Human, bacterial and 
yeast artificial chromosomes (HACs, BACs and YAGs respectively) may also be employed to deliver 
20 larger fragments of DNA than can be contained and expressed in a plasmid. 

Examples of retroviruses include but are not limited to: murine leukaemia virus (MLV), human 
immunodeficiency virus (HIV), equine infectious anaemia virus (EIAV), mouse mammary tumour virus 
(MMTV), Rous sarcoma virus (RSV), Fujinami sarcoma virus (FuSV), Moloney murine leukaemia virus 
(Mo-MLV), FBR murine osteosarcoma virus (FBR MSV), Moloney murine sarcoma virus (Mo-MSV), 
25 Abelson murine leukaemia virus (A-MLV), Avian myelocytomatosis virus-29 (MC29), and Avian 
erythroblastosis virus (AEV). A detailed list of retroviruses may be found in Coffin et al ("Retroviruses" 
1997 Cold Spring Harbour Laboratory Press Eds: JM Coffin, SM Hughes, HE Varmus pp 758-763). 

Lentiviruses can be divided into primate and non-primate groups. Examples of primate lentiviruses 
include but are not limited to: the human immunodeficiency virus (HIV), the causative agent of human 
30 auto-immunodeficiency syndrome (AIDS), and the simian immunodeficiency virus (SIV). The non- 
primate lentiviral group includes the prototype "slow virus" visna/maedi virus (VMV), as well as the 
related caprine arthritis-encephalitis virus (CAEV), equine infectious anaemia virus (EIAV) and the more 
recently described feline immunodeficiency virus (FIV) and bovine immunodeficiency virus (BIV). 
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A distinction between the lentivirus family and other types of retroviruses is that lentiviruses have the 
capability to infect both dividing and non-dividing cells (Lewis et al 1992 EMBO. J 1 1: 3053-3058; 
Lewis and Emerman 1994 J, Virol. 68; 510-516). In contrast, other retroviruses - such as MLV - are 
unable to infect non-dividing cells such as those that make up, for example, muscle, brain, lung and liver 
5 tissue. 

A vector may be configured as a split-intron vector, A split intron vector is described in PCT patent 
applications WO 99/15683 and WO 99/15684, 

If the features of adenoviruses are combined with the genetic stability of retroviruses/lentiviruses then 
essentially the adenovirus can be used to transduce target cells to become transient retroviral producer 
10 cells that could stably infect neighbouring cells. Such retroviral producer cells engineered to express an 
antigen of the present invention can be implanted in organisms such as animals or humans for use in the 
treatment of angiogenesis and/or cancer, 

Poxvirus vectors are also suitable for use in accordance with the present invention. Pox viruses are 
engineered for recombinant gene expression and for the use as recombinant live vaccines, This entails the 

15 use of recombinant techniques to introduce nucleic acids encoding foreign antigens into the genome of 
the pox virus. If the nucleic acid is integrated at a site in the viral DNA which is non-essential for the life 
cycle of the virus, it is possible for the newly produced recombinant pox virus to be infectious, that is to 
say to infect foreign cells and thus to express the integrated DNA sequence. The recombinant pox virus 
prepared in this way can be used as live vaccines for the prophylaxis and/or treatment of pathologic and 

20 infectious disease. 

For vaccine delivery, preferred vectors are vaccinia virus vectors such as MVA or NYVAC. Most 
preferred is the vaccinia strain modified virus ankara (MVA) or a strain derived therefrom. Alternatives 
to vaccinia vectors include avipox vectors such as fowlpox or canarypox known as ALVAC and strains 
derived therefrom which can infect and express recombinant proteins in human cells but are unable to 
25 replicate. 

Bacterial vectors may be also used, such as salmonella, listeria and mycobacteria. 

Vectors containing the relevant nucleotide sequence may enter the host cell by a variety of methods well 
known in the art and described in many standard laboratory manuals (such as Sambrook et al> [supra], 
Ausubel et al, [supra], Davis et al, Basic Methods in Molecular Biology (1986)). Methods include 
30 calcium phosphate transfection, cationic lipid-mediated transfection, DEAE-dextran mediated 
transfection, electroporation, microinjection, scrape loading, transduction, and ballistic introduction or 
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infection, 
Host cells 

The choice of host cells is. often dependent on the vector type used as a carrier for the nucleic acid 
molecule of the present invention. Bacteria and other microorganisms are particularly suitable hosts for 
5 plasmids, cosmids and expression vectors generally (for example, vectors derived from the pBR322 
plasmid), yeast are suitable hosts for yeast expression vectors, insect cell systems are suitable host for 
virus expression vectors (for example, baculovirus) and plant cells are suitable hosts for vectors such as 
the cauliflower mosaic virus (CaMV) and tobacco mosaic virus (TM V), Other expression systems include 
using animal cells (for example, with the LentiVectors™, Oxford BioMedica) as a host cell or even using 
10 cell-free translating systems, Some vectors, such as "shuttle vectors" may be maintained in a variety of 
host cells. An example of such a vector would be pEG 202 and other yeast two-hybrid vectors which can 
be maintained in both yeast and bacterial cells (see Ausubel et al } [supra] and Gyuris, J., Cell, 75, 791- 
803). 

Examples of suitable bacterial hosts include Streptococci Staphylococci, Escherichia coli, Streptomyces 
15 and Bacillus subtilis cells. Yeast and fungal hosts include Saccharomyces cerevisiae and Aspergillus 
cells. Mammalian cell hosts include many immortalised cell lines available from the American Type 
Culture Collection (ATCC) such as CHO (Chinese Hamster Ovary) cells, HeLa cells, BHK (baby hamster 
kidney) cells, monkey kidney cells, C127, 3T3, BHK, HEK 293, Bowes melanoma and human 
hepatocellular carcinoma (for example, Hep G2) cells. Insect host cells that are used for baculovirus 
20 expression include Drosophila S2 and Spodoptera Sf9 cells. Plant host cells include most plants from 
which protoplasts be isolated and cultured to give whole regenerated plants. Practically, all plants can be 
regenerated from cultured cells or tissues, including but not limited to all major species of sugar cane, 
sugar beet, cotton, fruit and other trees, legumes and vegetables. 

Expression systems 

25 Also included in present invention are expression vectors that comprise a nucleic acid molecule as 
described above. Expression vectors and host cells are preferably chosen to give long term, high yield 
production and stable expression of the recombinant polypeptide and its variants. 

Expression of a polypeptide can be effected by cloning an encoding nucleic acid molecule into a suitable 
expression vector and inserting this vector into a suitable host cell. The positioning and orientation of the 
30 nucleic acid molecule insert with respect to the regulatory sequences of the vector is important to ensure 
that the coding sequence is properly transcribed and translated. Alternatively, control and other regulatory 
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sequences may be ligated onto the nucleic acid molecule of this invention prior to its insertion into the 
expression vector. In both cases, the sequence of the nucleic acid molecule may have to be adjusted in 
order to effect correct transcription and translation (for example, addition of nucleotides may be necessary 
to obtain the correct reading frame for translation of the polypeptide from its encoding nucleic acid 
5 molecule). 

A nucleic acid molecule of the invention may comprise control sequences that encode signal peptides or 
leader sequences. These sequences may be useful in directing the translated polypeptide to a variety of 
locations within or outside the host cell, such as to the lumen of the endoplasmic reticulum, to the 
nucleus, to the periplasmic space, or into the extracellular environment. Such signals may be endogenous 
10 to the nucleic acid molecules of the invention, or may be a heterologous sequence. These leader or control 
sequences may be removed by the host during post-translational processing. 

A nucleic acid molecule of the present invention may also comprise one or more regulatory sequences 
that allow for regulation of the expression of polypeptide relative to the growth of the host cell. 
Alternatively, these regulatory signals may be due to a heterologous sequence from the vector. Stimuli 

15 that these sequences respond to include those of a physical or chemical nature such as the presence or 
absence of regulatory compounds, changing temperatures or metabolic conditions. Regulatory sequences 
as described herein, are non-translated regions of sequence such as enhancers, promoters and the 5' and 
3' untranslated regions of genes. Regulatory sequences interact with host cellular proteins that carry out 
translation and transcription, These regulatory sequences may vary in strength and specificity. Examples 

20 of regulatory sequences include those of constitutive and inducible promoters. In bacterial systems, an 
example of an inducible promoter is the hybrid lacZ promoter of the Bluescript phagemid (Stratagene, 
LaJolla, CA) or pSportlTM plasmid (Gibco BRL). The baculovirus polyhedrin promoter may be used in 
insect cells. 

An example of a preferred expression system is the lentivirus expression system, for example, as 
25 described in International patent application W 098/17815, 

Detection of uptake of vectors by the host organism 

Various methods are known in the art to detect the uptake of a nucleic acid or vector molecule by a host 
cell and/or the subsequent successful expression of the encoded polypeptide (see for example Sambrook 
et al, [supra]). 

30 Vectors frequently have marker genes that can be easily assayed. Thus, vector uptake by a host cell can 
be readily detected by testing for the relevant phenotype. Markers include, but are not limited to those 
coding for antibiotic resistance, herbicide resistance or nutritional requirements. The gene encoding 
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dihydrofolate reductase (DHFR) for example, confers resistance to methotrexate (Wigler, M. et al (1980) 
PNAS 77:3567-70) and the gene npt confers resistance to the aminoglycosides neomycin and G-418 
(Colbere-Garapin, F, et al (1981) J. Mol. Biol. 150:1-14). Additional selectable genes have been 
described, examples of which will be clear to those of skill in the art. 

5 Markers however, only indicate that a vector has been taken up by a host cell but does not distinguish 
between vectors that contain the desired nucleic acid molecule and those that do not. One method of 
detecting for the said nucleic acid molecule is to insert the relevant sequence at a position that will disrupt 
the transcription and translation of a marker gene. These cells can then be identified by the absence of a 
marker gene phenotype. Alternatively, a marker gene can be placed in tandem with a sequence encoding a 
10 polypeptide of the invention under the control of a single promoter. Expression of the marker gene in 
response to induction or selection usually indicates expression of the tandem gene as well. 

More direct and definitive methods to detect the presence of the nucleic acid molecule of the present 
invention include DNA-DNA or DNA-RNA hybridisation with a probe comprising the relevant antisense 
molecule, as described above. More direct methods to detect polypeptide expression include protein 
15 bioassays for example, fluorescence activated cell sorting (FACS), immunoassay techniques such as 
ELISA or radioimmunoassays. 

Alternative methods for detecting or quantitating the presence of the nucleic acid molecule or polypeptide 
of this invention include membrane, solution or chip-based technologies (see Hampton, R. et al, (1990) 
Serological Methods, a Laboratory Manual, APS Press, St Paul, MN) and Maddox, D.E. et al, (1983) J. 
20 -Exp. Med, 158, 1211-1216). 

Transgenic animals 

In another embodiment of this invention, a nucleic acid molecule according to the invention may be used 
to create a transgenic animal, most commonly a rodent. The modification of the animal's genome may 
either be done locally, by modification of somatic cells or by germ line therapy to incorporate inheritable 
25 modifications. Such transgenic animals may be particularly useful in the generation of animal models for 
drug molecules effective as modulators of the polypeptides of the present invention. 

Polypeptide purification 

A polypeptide according to the invention may be recovered and purified from recombinant cell cultures 
by methods including, but not limited to cell lysis techniques, ammonium sulphate precipitation, ethanol 
30 precipitation, acid extraction, anion or cation chromatography, phosphocellulose chromatography, 
hydrophobic interaction chromatography, affinity chromatography, hydroxylapatite chromatography and 
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lectin chromatography, high performance liquid chromatography (HPLC) or fast performance liquid 
chromatography (FPLC). The polypeptide may need refolding after purification or isolation and many 
well known techniques are available that will help regenerate an active polypeptide conformation. 

Many expression vectors are commercially available that aid purification of the relevant polypeptide. 
5 These include vectors that join the sequence encoding the polypeptide to another expressed sequence 
creating a fused protein that is easier to purify. Ways in which these fused parts can facilitate purification 
of the polypeptide of this invention include fusions that can increase the solubility of the polypeptide, 
joining of metal chelating peptides (for example, histidine-tryptophan modules) that allow for purification 
with immobilised metals, joining of protein A domains which allow for purification with immobilised 
10 immunoglobulins and the joining of the domain that is utilised in the FLAGS extension/affinity 
purification system (Immunex Corp,, Seattle, WA). Fusion of the polypeptide of this present invention 
with a secretion signal polypeptide may also aid purification. This is because the medium into which the 
fused polypeptide has been secreted can subsequently be used to recover and purify the expressed 
polypeptide. 

15 If necessary, these extraneous polypeptides often comprise a cleavable linker sequence which allows the 
polypeptide to be isolated from the fusion. Cleavable linker sequences between the purification domain 
and the polypeptide of the invention include those specific for Factor Xa or for enterokinase (Invitrogen, 
San Diego, CA). One such expression vector provides for expression of a fusion protein containing the 
polypeptide of the invention fused to several histidine residues preceding a thioredoxin or an enterokinase 

20 cleavage site. The histidine residues facilitate purification by IMAC (immobilised metal ion affinity 
chromatography as described in Porath, J. et al (1992), Prot. Exp. Purif. 3: 263-281), while the 
thioredoxin or enterokinase cleavage site provides a means for purifying the polypeptide from the fusion 
protein. A discussion of vectors that contain fusion proteins is provided in Kroll, D.J. et al (1993; DNA 
Cell Biol. 12:441-453). 

25 Assays 

Another aspect of this invention includes assays that may be carried out using a polypeptide or nucleic 
acid molecule according to the invention. Such assays may be for many uses including the development 
of drug candidates, for diagnostic purposes or for the gathering of information for therapeutics. 

If the polypeptide is to be expressed for use in screening assays, generally it is preferred that it be 
30 produced at the surface of the host cell in which it is expressed. In this event, the host cells may be 
harvested prior to use in the screening assay, for example using techniques such as fluorescence activated 
cell sorting (FACS) or immunoaffinity techniques. If the polypeptide is secreted into the medium, the 
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medium can be recovered in order to recover and purify the expressed polypeptide, If polypeptide is 
produced intracellular^, the cells must first be lysed before the polypeptide is recovered. 

The polypeptide of the invention can be used to screen libraries of compounds in any of a variety of drug 
screening techniques. Such compounds may activate (agonise) or inhibit (antagonise) the level of 
5 expression of the gene or the activity of the polypeptide of the invention and form a further aspect of the 
present invention. Examples of suitable compounds are those which are effective to alter the expression 
of a natural gene which encodes a polypeptide of the invention or to regulate the activity of a polypeptide 
of the invention, 

Agonist or antagonist compounds may be isolated from, for example, cells, cell-free preparations, 
10 chemical libraries or natural product mixtures, These agonists or antagonists may be natural or modified 
substrates, ligands, enzymes, receptors or structural or functional mimetics. For a suitable review of such 
screening techniques, see Coligan et aL, Current Protocols in Immunology 1 (2):Chapter 5 (1991), 

Potential agonists or antagonists include small organic molecules, peptides, polypeptides and antibodies 
that bind to the polypeptide of the invention and thereby modulate its activity. In this fashion, binding of 
15 the polypeptide to normal cellular binding molecules may be potentiated or inhibited, such that the 
normal biological activity of the polypeptide is enhanced or prevented. 

The polypeptide of the invention that is employed in such a screening technique may be free in solution, 
affixed to a solid support, borne on a cell surface or located intracellularly, In general, such screening 
procedures may involve using appropriate cells or cell membranes that express the polypeptide that are 

20 contacted with a test compound to observe binding, or stimulation or inhibition of a functional response, 
The functional response of the cells contacted with the test compound is then compared with control cells 
that were not contacted with the test compound. Such an assay may assess whether the test compound 
results in a signal generated by activation of the polypeptide, using an appropriate detection system, 
Inhibitors of activation are generally assayed in the presence of a known agonist and the effect on 

25 activation by the agonist in the presence of the test compound is observed. 

Alternatively, simple binding assays may be used, in which the adherence of a test compound to a surface 
bearing the polypeptide is detected by means of a label directly or indirectly associated with the test 
compound or in an assay involving competition with a labelled competitor. In another embodiment, 
competitive drug screening assays may be used, in which neutralising antibodies that are capable of 
30 binding the polypeptide specifically compete with a test compound for binding. In this manner, the 
antibodies can be used to detect the presence of any test compound that possesses specific binding affinity 
for the polypeptide. 
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Assays may also be designed to detect the effect of added test compounds on the production of mRNA 
encoding the polypeptide in cells, For example, an ELISA may be constructed that measures secreted or 
cell-associated levels of polypeptide using monoclonal or polyclonal antibodies by standard methods 
known in the art, and this can be used to search for compounds that may inhibit or enhance the production 
5 of the polypeptide from suitably manipulated cells or tissues. The formation of binding complexes 
between the polypeptide and the compound being tested may then be measured. 

Another technique for drug screening which may be used provides for high throughput screening of 
compounds having suitable binding affinity to the polypeptide of interest (see International patent 
application W 084/03564). In this method, large numbers of different small test compounds are 
10 synthesised on a solid substrate, which may then be reacted with the polypeptide of the invention and 
washed. One way of immobilising the polypeptide is to use non-neutralising antibodies. Bound 
polypeptide may then be detected using methods that are well known in the art* Purified polypeptide can 
also be coated directly onto plates for use in the aforementioned drug screening techniques. 

A polypeptide according to the invention may be used to identify membrane-bound or soluble receptors, 
15 through standard receptor binding techniques that are known in the art, such as ligand binding and 
crosslinking assays in which the polypeptide is labelled with a radioactive isotope, is chemically 
modified, or is fused to a peptide sequence that facilitates its detection or purification, and incubated with 
a source of the putative receptor (for example, a composition of cells, cell membranes, cell supernatants, 
tissue extracts, or bodily fluids). The efficacy of binding may be measured using biophysical techniques 
20 such as surface plasmon resonance and spectroscopy. Binding assays may be used for the purification and 
cloning of the receptor, but may also identify agonists and antagonists of the polypeptide, that compete 
with the binding of the polypeptide to its receptor. Standard methods for conducting screening assays are 
well understood in the art. 

A typical polypeptide-based assay might involve contacting the appropriate cell(s) or cell membrane(s) 
25 expressing the polypeptide with a test compound. In such assays, a polypeptide according to the invention 
may be free in solution, affixed to a solid support, borne on a cell surface or located intracellularly. Any 
response to the test compound, for example a binding response, a stimulation or inhibition of a functional 
response may then be compared with a control where the cell(s) or cell membrane(s) was/were not 
contacted with the test compound. 

30 A binding response could be measured by testing for the adherence of a test compound to a surface 
bearing a polypeptide according to the invention. The test compound may aid polypeptide detection by 
being labelled, either directly or indirectly. Alternatively, the polypeptide itself may be labelled, for 
example, with a radioisotope, by chemical modification or as a fusion with a peptide or polypeptide 



WO 02/46465 



PCT/GB01/05458 



52 

sequence that will facilitate polypeptide detection. Alternatively, a binding response may be measured, 
for example, by performing a competition assay with a labelled competitor or vice versa. One example of 
such a technique is a competitive drug screening assay, where neutralising antibodies that are capable of 
specifically binding to the polypeptide compete with a test compound for binding. In this manner, the 
5 antibodies may be used to detect the presence of any test compound that possesses specific binding 
affinity for the polypeptide. Alternative binding assay methods are well known in the art and include, but 
are not limited to, cross-linking assays and filter binding assays. The efficacy of binding may be 
measured using biophysical techniques including surface plasmon resonance and spectroscopy. 

High throughput screening is a type of assay which enables a large number of compounds to be searched 
10 for any significant binding activity to the polypeptide of interest (see patent application WO84/03564). 
This is particularly useful in drug screening. In this scenario, many different small test compounds are 
synthesised on to a solid substrate. The polypeptide is then introduced to this substrate and the whole 
apparatus washed. The polypeptide is then immobilised by, for example, using non-neutralising 
antibodies. Bound polypeptide may then be detected using methods that are well known in the art. 
15 Purified polypeptide may also be coated directly onto plates for use in the aforementioned drug screening 
techniques. 

Assay methods that are also included within the terms of the present invention are those that involve the 
use of the genes and polypeptides of the invention in overexpression or ablation assays. Such assays 
involve the manipulation of levels of these genes/polypeptides in cells and assessment of the impact of 
20 this manipulation event on the physiology of the manipulated cells. For example, such experiments reveal 
details of signaling and metabolic pathways in which the particular genes/polypeptides are implicated, 
generate information regarding the identities of polypeptides with which the studied polypeptides interact 
and provide clues as to methods by which related genes and proteins are regulated. 

Another aspect of this invention provides for any screening kits that are based or developed from any of 
25 the aforementioned assays. 

C. Pharmaceuticals 

A further aspect of the invention provides a pharmaceutical composition suitable for modulating hypoxia 
and/or ischaemia, comprising a therapeutically -effective amount of a polypeptide, a nucleic acid 
molecule, vector or Hgand as described above, in conjunction with a pharmaceutically-aeceptable carrier. 
30 A composition containing a polypeptide, nucleic acid molecule, ligand or any other compound of this 
present invention (herein known as X) is considered to be "substantially free of impurities" (herein known 
as Y) when X makes up more than 85% mass per mass of the total [X+Y] mass. Preferably X comprises 
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at least 90% of the total X+Y mass. More preferably X comprises at least 95%, 98% and most preferably 
99% of the total X+Y mass. 

Carriers 

Carrier molecules may be genes, polypeptides, antibodies, liposomes or indeed any other agent provided 
5 that the carrier does not itself induce toxicity effects or cause the production of antibodies that are harmful 
to the individual receiving the pharmaceutical composition. Further examples of known carriers include 
polysaccharides, polylactic acids, polyglycolic acids and inactive virus particles. Carriers may also 
include pharmaceutical^ acceptable salts such as mineral acid salts (for example, hydrochlorides, 
hydrobromides, phosphates, sulphates) or the salts of organic acids (for example, acetates, propionates, 

10 malonates, benzoates). Pharmaceutical^ acceptable carriers may additionally contain liquids such as 
water, saline, glycerol, ethanol or auxiliary substances such as wetting or emulsifying agents, pH 
buffering substances and the like. Carriers may enable the pharmaceutical compositions to be formulated 
into tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions to aid intake by the patient. 
A thorough discussion of pharmaceutical^ acceptable carriers is available in Remington's Pharmaceutical 

15 Sciences (Mack Pub. Co., N.J. 1991). 

Dosage 

The amount of component X in the composition should also be in therapeutically effective amounts. The 
phrase "therapeutically effective amounts" used herein refers to the amount of agent needed to treat, 
ameliorate, or prevent (for example, when used as a vaccine) a targeted disease or condition. An effective 

20 initial method to determine a "therapeutically effective amount" may be by carrying out cell culture 
assays (for example, using neoplastic cells) or using animal models (for example, mice, rabbits, dogs or 
pigs). In addition to determining the appropriate concentration range for X to be therapeutically effective, 
animal models may also yield other relevant information such as preferable routes of administration that 
will give maximum effectiveness. Such information may be useful as a basis for patient administration. A 

25 "patient" as used in herein refers to the subject who is receiving treatment by administration of X. 
Preferably, the patient is human, but the term may also include animals. 

The therapeutically-effective dosage will generally be dependent on the patient's status at the time of 
adminstration, Factors that may be taken into consideration when determining dosage include the severity 
of the disease state in the patient, the general health of the patient, the age, weight, gender, diet, time and 
30 frequency of administration, drug combinations, reaction sensitivities and the patient's tolerance or 
response to the therapy. The precise amount can be determined by routine experimentation but may 
ultimately lie with the judgement of the clinician. Generally, an effective dose will be from 0.01 mg/kg 
(mass of drug compared to mass of patient) to 50 mg/kg, preferably 0.05 mg/kg to 10 mg/kg. 
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Compositions may be administered individually to a patient or may be administered in combination with 
other agents, drugs or hormones. 

Routes of adm inistration 

Uptake of a pharmaceutical composition of the invention by a patient may be initiated by a variety of 
5 methods including, but not limited to enteral, intraarterial, intrathecal, intramedullary, intramuscular, 
intranasal, intraperitoneal, intravaginal, intravenous, intraventricular, oral, rectal (for example, in the form 
of suppositories), subcutaneous, sublingual, transcutaneous applications (for example, see WO98/20734) 
or transdermal means. 

Gene guns or hyposprays may also be used to administer the pharmaceutical compositions of the 
10 invention. Typically, the therapeutic compositions may be prepared as injectables, either as liquid 
solutions or suspensions; solid forms suitable for solution in, or suspension in, liquid vehicles prior to 
injection may also be prepared. Direct delivery of the compositions can generally be accomplished by 
injection, subcutaneously, intraperitoneally, intravenously or intramuscularly, or delivered to the 
interstitial space of a tissue. The compositions can also be administered into a lesion. Dosage treatment 
15 may be a single dose schedule or a multiple dose schedule. 

Inhibition of excessive activity 

If a particular disease state is partially or completely caused by an inappropriate excess in the activity of a 
polypeptide according to the invention, several approaches are available for inhibiting this activity. 

One approach comprises administering to a patient an inhibitor compound (antagonist) along with a 
20 pharmaceutical^ acceptable carrier in an amount effective to inhibit the function of the polypeptide, such 
as by blocking the binding of a ligand, substrate, enzyme, receptor, or by inhibiting a second signal, and 
thereby alleviating the abnormal condition. Such an antagonist molecule may, for example, be an 
antibody. Most preferably, such antibodies are chimeric and/or humanised to minimise their 
immunogenicity, as previously described. 

25 In another approach, soluble forms of the polypeptide that retain binding affinity for the ligand, substrate, 
enzyme, receptor, in question, may be administered to the patient to compete with the biological activity 
of the endogenous polypeptide. Typically, the polypeptide may be administered in the form of a fragment 
that retains a portion that is relevant for the desired biological activity. 

In an alternative approach, expression of the gene encoding the polypeptide can be inhibited using 
30 expression blocking techniques, such as by using antisense nucleic acid molecules (as described above), 
either internally generated or separately administered. Modifications of gene expression may be effected 
by designing complementary sequences or antisense molecules (DNA, RNA, or PNA) to the control, 5' or 
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regulatory regions (signal sequence, promoters, enhancers and introns) of the gene encoding the 
polypeptide, Similarly, inhibition can be achieved using "triple helix" base-pairing methodology. Triple 
helix pairing is useful because it causes inhibition of the ability of the double helix to open sufficiently for 
the binding of polymerases, transcription factors, or regulatory molecules. Recent therapeutic advances 
5 using triplex DNA have been described in the literature (Gee, J.E. et al (1994) In: Huber, BiL and B.L 
Carr, Molecular and Immunologic Approaches, Futura Publishing Co., Mt. Kisco, NY). The 
complementary sequence or antisense molecule may also be designed to block translation of mRNA by 
preventing the transcript from binding to ribosomes. Such oligonucleotides may be administered or may 
be generated in situ from expression in vivo, 

10 Gene silencing approaches may also be undertaken to down-regulate endogenous expression of a gene. 
RNA interference (RNAi) (Elbashir, SM et al., Nature 2001, 411, 494-498) is one method of sequence 
specific post-transcriptional gene silencing that may be employed. Short dsRNA oligonucleotides are 
synthesised in vitro and introduced into a cell. The sequence specific binding of these dsRNA 
oligonucleotides triggers the degradation of target mRNA, reducing or ablating target protein expression, 

15 In addition, expression of a polypeptide according to the invention may be prevented by using a ribozyme 
specific to the encoding mRNA sequence for the polypeptide. Ribozymes are cataly tically active RNAs 
that can be natural or synthetic (see for example Usman, N, et al, Curr. Opin. Struct. Biol (1996) 6(4), 
527-33). Synthetic ribozymes can be designed to specifically cleave mRNAs at selected positions thereby 
preventing translation of the mRNAs into functional polypeptide. Ribozymes may be synthesised with a 

20 natural ribose phosphate backbone and natural bases, as normally found in RNA molecules. Alternatively 
the ribozymes may be synthesised with non-natural backbones, for example, 2'-0-methyl RNA, to 
provide protection from ribonuclease degradation and may contain modified bases. 

Efficacy of the gene silencing approaches assessed above may be assessed through the measurement of 
polypeptide expression (for example, by Western blotting), and at the RNA level using TaqMan-based 
25 methodologies. 

RNA molecules may be modified to increase their intracellular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the 5* and/or 3 1 ends of 
the molecule or the use of phosphorothioate or V O-methyl rather than phosphodiesterase linkages within 
the backbone of the molecule. This concept is inherent in the production of PNAs and can be extended in 
30 all of these molecules by the inclusion of non-traditional bases such as inosine, queosine and butosine, as 
well as acetyl-, methyl-, thio- and similarly modified forms of adenine, cytidine, guanine, thymine and 
uridine that are not as easily recognised by endogenous endonucleases. 
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Activation of a polypeptide activity 

If a particular disease state is partially or completely due to a lowered level of biological activity from a 
polypeptide according to the invention, various methods may be used. An example of such a method 
includes administering a therapeutically effective amount of compound that can activate (i.e. an agonist) 
5 or cause increased expression of the polypeptide concerned. Administration of such a compound may be 
via any of the methods described previously. 

Gene Therapy 

Another aspect of the present invention provides for gene therapy methods involving nucleic acid 
molecules identified herein. Gene therapy may be used to affect the endogenous production of the 
10 polypeptide of the present invention by relevant cells in a patient. For example, gene therapy can be used 
permanently to treat the inappropriate production of a polypeptide by replacing a defective gene with the 
corrected therapeutic gene, 

Treatment may be effected either in vivo or ex vivo. Ex vivo gene therapy generally involves the isolation 
and purification of the patient's cells, introduction of the therapeutic gene into the cells and finally, the 

15 introduction of the genetically-altered cells back into the patient. In vivo gene therapy does not require the 
isolation and purification of patient cells prior to the introduction of the therapeutic gene into the patient. 
Instead, the therapeutic gene can be packaged for delivery into the host. Gene delivery vehicles for in vivo 
gene therapy include, but are not limited to, non-viral vehicles such as liposomes, replication-competent 
and replication-deficient viruses (for example, adenovirus as described by Berkner, K.L., in Curr. Top. 

20 Microbiol. Immunol., 158, 39-66 (1992)) or adeno-associated virus (AAV) vectors as described by 
Muzyczka, N., in Curr. Top. Microbiol. Immunol., 158, 97-129 (1992) and U.S. Patent No. 5,252,479. 
Alternatively, "naked DNA" may be directly injected into the bloodstream or muscle tissue as a form of 
in vivo gene therapy. 

One example of a strategy for gene therapy including a nucleic acid molecule of this present invention 
25 may be as follows. A nucleic acid molecule encoding a polypeptide of the invention is engineered for 
expression in a replication-defective or replication-competent vector, such as a retroviral vector. This 
expression construct may then be isolated and introduced into a packaging cell transduced with a 
retroviral plasmid vector containing RNA encoding the polypeptide, such that the packaging cell now 
produces infectious viral particles containing the gene of interest. These producer cells may be 
30 administered to a patient for engineering cells in vivo and expression of the polypeptide in vivo (see 
Chapter 20, Gene Therapy and other Molecular Genetic-based Therapeutic Approaches, (and references 
cited therein) in Human Molecular Genetics (1996), T Strachan and A P Read, BIOS Scientific Publishers 
Ltd). 
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Ge-netic delivery of antibodies that bind to polypeptides according to the invention may also be effected, 
for example, as described in International patent application WO98/55607, 

Vaccines 

A further embodiment of the present invention provides that the polypeptides or nucleic acid molecules 
5 identified may be used in the development of vaccines. Where the aforementioned polypeptide or nucleic 
acid molecule is a disease-causing agent, vaccine development can involve the raising of antibodies 
against such agents. Where the aforementioned polypeptide or nucleic acid molecule is that is 
upregulated, vaccine development can involve the raising of antibodies or T cells against such agents (as 
described in WO00/29428). 

10 Vaccines according to the invention may either be prophylactic (i.e. prevents infection) or therapeutic (i.e. 
treats disease after infection). Such vaccines comprise immunising antigen(s), immunogen(s), 
polypeptide(s), protein(s) or nucleic acid, usually in combination with pharmaceutically-acceptable 
carriers as described above. Additionally, these carriers may function as immunostimulating agents 
("adjuvants"), Furthermore, the antigen or immunogen may be conjugated to a bacterial toxoid, such as a 

15 toxoid from diphtheria, tetanus, cholera, H, pylori, and other pathogens, 

Vaccination processes may involve the use of heterologous vectors eg: prime with MVA and boost with 
DNA. 

Since polypeptides may be broken down in the stomach, vaccines comprising polypeptides are preferably 
administered parenterally (for instance, subcutaneous, intramuscular, intravenous, or intradermal 
20 injection). Formulations suitable for parenteral administration include aqueous and non-aqueous sterile 
injection solutions that may contain anti-oxidants, buffers, bacteriostats and solutes which render the 
formulation isotonic with the blood of the recipient, and aqueous and non-aqueous sterile suspensions 
which may include suspending agents or thickening agents. 

The vaccine formulations of the invention may be presented in unit-dose or multi-dose containers. For 
25 example, sealed ampoules and vials and may be stored in a freeze-dried condition requiring only the 
addition of the sterile liquid carrier immediately prior to use. The dosage will depend on the specific 
activity of the vaccine and can be readily determined by routine experimentation. 

The technology referred to as jet injection (see, for example, www.powderject.com) may also be useful in 
the formulation of vaccine compositions. 

30 In accordance with this aspect of the present invention, polypeptides can be delivered by viral or non-viral 
techniques, Non-viral delivery systems include but are not limited to DNA transfection methods. Here, 
transfection includes a process using a non-viral vector to deliver a antigen gene to a target mammalian 
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cell. Typical transfection methods include electroporation, nucleic acid biolistics, lipid-mediated 
transfection, compacted nucleic acid-mediated transfection, liposomes, immunoliposomes, lipofectin, 
cationic agent-mediated, cationic facial amphiphiles (CFAs) (Nature Biotechnology 1996 14; 556), 
multivalent cations such as spermine, cationic lipids or polylysine, 1, 2,-bis (oleoyloxy)-3- 
5 (trimethylammonio) propane (DOTAP)-cholesterol complexes (Wolff and Trubetskoy 1998 Nature 
Biotechnology 16: 421) and combinations thereof. 

Viral delivery systems include but are not limited to adenovirus vectors, adeno-associated viral (AAV) 
vectors, herpes viral vectors, influenza, retroviral vectors, lentiviral vectors or baculoviral vectors, 
Venezuelan equine encephalitis virus (VEE), poxviruses such as: canarypox virus (Taylor et al 1995 
10 V accine 13:539-549), entomopox virus (Li Y et al 1998 Xllth International Poxvirus Symposium p 1 4 4 . 
Abstract), penguine pox (Standard et al. J Gen Virol. 1998 79:1637-46) alphavirus, and alphavirus based 
DNA vectors. 

In addition to the use of polypeptide-based vaccines, this aspect of the invention includes the use of 
genetically-based vaccines, for example, those vaccines that are effective through eliciting the expression 
15 of a particular gene (either endogenous or exogenously derived) in a cell, so targeting this cell for 
destruction by the immune system of the host organism. 

A number of suitable methods for vaccination and vaccine delivery systems are described in International 
patent application W 000/29428. 

D. Diagnostics 

20 Another aspect of the present invention provides for the use of a nucleic acid molecule identified herein 
as a diagnostic reagent. 

For example, a nucleic acid molecule may be detected or isolated from a patient's tissue and used for 
diagnostic purposes. "Tissue" as defined herein refers to blood, urine, any matter obtained from a tissue 
biopsy or any matter obtained from an autopsy. Genomic DNA from the tissue sample may be used 

25 directly for detection of a hypoxia-related condition. Alternatively, the DNA may be amplified using 
methods such as polymerase chain reaction (PCR), the ligase chain reaction (LCR), strand displacement 
amplification (SDA), or other amplification techniques (see Saiki et a/., Nature, 324, 163-166 (1986); 
Bej, et al, Crit. Rev. Biochem. Molec. BioL, 26, 301-334 (1991); Birkenmeyer et al, J. Virol. Meth., 35, 
1 17-126 (1991) and Brunt, J., Bio/Technology, 8, 291-294 (1990)). Such diagnostics are particularly 

30 useful for prenatal and even neonatal testing. 

A method of diagnosis of disease using a polynucleotide may comprise assessing the level of expression 
of the natural gene and comparing the level of encoded polypeptide to a control level measured in a 
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normal subject that does not suffer from the disease or physiological condition that is being tested, The 
diagnosis may comprise the following steps: 

a) contacting a sample of tissue from the patient with a nucleic acid probe under stringent conditions 
that allow the formation of a hybrid complex between a nucleic acid molecule of the invention and 

5 the probe; 

b) contacting a control sample with said probe under the same conditions used in step a); and 

c) detecting the presence of hybrid complexes in said samples; 

wherein detection of differing levels of the hybrid complex in the patient sample compared to levels of 
the hybrid complex in the control sample is indicative of the dysfunction, 

10 A further aspect of the invention comprises a diagnostic method comprising the steps of: 

a) obtaining a tissue sample from a patient being tested for disease; 

b) isolating a nucleic acid molecule according to the invention from said tissue sample; and 

c) diagnosing the patient for disease by detecting the presence of a mutation in the nucleic acid 
molecule which is associated with disease. 

15 To aid the detection of nucleic acid molecules in the above-described methods, an amplification step, 
such as PCR, may be included. An example of this includes detection of deletions or insertions indicative 
of the dysfunction by a change in the size of the amplified product in comparison to the normal genotype. 
Point mutations can be identified by hybridising amplified DNA to labelled RNA of the invention or 
alternatively, labelled antisense DNA sequences of the invention. Perfectly matched sequences can be 

20 distinguished from mismatched duplexes by RNase digestion or by assessing differences in melting 
temperatures. The presence or absence of the mutation in the patient may be detected by contacting DNA 
with a nucleic acid probe that hybridises to the DNA under stringent conditions to form a hybrid double- 
stranded molecule, the hybrid double-stranded molecule having an unhybridised portion of the nucleic 
acid probe strand at any portion corresponding to a mutation associated with disease; and detecting the 

25 presence or absence of an unhybridised portion of the probe strand as an indication of the presence or 
absence of a disease-associated mutation in the corresponding portion of the DNA strand. 

Point mutations and other sequence differences between the reference gene and "mutant" genes can be 
identified by other well-known, techniques, such as direct DNA sequencing or single-strand 
conformational polymorphism, (see Orita et al, Genomics, 5, 874-879 (1989)). For example, a 
30 sequencing primer may be used with double-stranded PCR product or a single-stranded template 
molecule generated by a modified PCR. The sequence determination is performed by conventional 
procedures with radiolabeled nucleotides or by automatic sequencing procedures with fluorescent-tags. 
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Cloned DNA segments may also be used as probes to detect specific DNA segments. The sensitivity of 
this method is greatly enhanced when combined with PCR. Further, point mutations and other sequence 
variations, such as polymorphisms, can be detected as described above, for example, through the use of 
allele-specific oligonucleotides for PCR amplification of sequences that differ by single nucleotides. 

5 DNA sequence differences may also be detected by alterations in the electrophoretic mobility of DNA 
fragments in gels, with or without denaturing agents, or by direct DNA sequencing (for example, Myers 
et al, Science (1985) 230:1242). Sequence changes at specific locations may also be revealed by nuclease 
protection assays, such as RNase and SI protection or the chemical cleavage method (see Cotton et al, 
PNAS.USA (1985) 85:4397-4401). 

10 In addition to conventional gel electrophoresis and DNA sequencing, mutations such as microdeletions, 
aneuploidies, translocations, inversions, can also be detected by in situ analysis (see, for example, Keller 
et al, DNA Probes, 2nd Ed>, Stockton Press, New York, N.Y., USA (1993)), that is, DNA or RNA 
sequences in cells can be analysed for mutations without need for their isolation and/or immobilisation 
onto a membrane. FISH is presently the most commonly applied method and numerous reviews of FISH 

15 have appeared (see, for example, Trachuck et al, Science, 250, 559-562 (1990), and Trask et al, Trends, 
Genet., 7, 149-154 (1991)). 

Arrays 

In another embodiment of the invention, an array of oligonucleotide probes comprising a nucleic acid 
molecule according to the invention can be constructed to conduct efficient screening of genetic variants, 
20 mutations and polymorphisms. Array technology methods are well known and have general applicability 
and can be used to address a variety of questions in molecular genetics including gene expression, genetic 
linkage, and genetic variability (see for example: M.Chee et al, Science (1996), Vol 274, pp 610-613). 

In one embodiment, the array is prepared and used according to the methods described in W095/1 1995 
(Chee et al)\ Lockhart, D. J. et al (1996) Nat. Biotech. 14: 1675-1680); and Schena, M. et al (1996) 

25 PNAS 93: 10614-10619). Oligonucleotide pairs may range from two to over one million. The oligomers 
are synthesized at designated areas on a substrate using a light-directed chemical process. The substrate 
may be paper, nylon or other type of membrane, filter, chip, glass slide or any other suitable solid support. 
In another aspect, an oligonucleotide may be synthesized on the surface of the substrate by using a 
chemical coupling procedure and an ink jet application apparatus, as described in PCT application 

30 W095/25 1 1 16 (Baldeschweiler etal), In another aspect, a "gridded" array analogous to a dot (or slot) blot 
may be used to arrange and link cDNA fragments or oligonucleotides to the surface of a substrate using a 
vacuum system, thermal, UV, mechanical or chemical bonding procedures. An array, such as those 
described above, may be produced by hand or by using available devices (slot blot or dot blot apparatus), 
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materials (any suitable solid support), and machines (including robotic instruments), and may contain 8, 
24, 96, 384, 1536 or 6144 oligonucleotides, or any other number between two and over one million which 
lends itself to the efficient use of commercially-available instrumentation, 

Diagnostics using polypeptides or mRNA 

5 In addition to the methods discussed above, diseases may be diagnosed by methods comprising 
determining, from a sample derived from a subject, an abnormally decreased or increased level of 
polypeptide or mRNA. Decreased or increased expression can be measured at the RNA level using any of 
the methods well known in the art for the quantitation of polynucleotides, such as, for example, nucleic 
acid amplification, for instance PCR, RT-PCR, RNase protection, Northern blotting and other 
10 hybridization methods. 

Assay techniques that can be used to determine levels of a polypeptide of the present invention in a 
sample derived from a host are well-known to those of skill in the art and are discussed in some detail 
above (including radioimmunoassays, competitive-binding assays, Western Blot analysis and ELISA 
assays). One example of this aspect of the invention provides a diagnostic method which comprises the 
15 steps of: (a) contacting a ligand as described above with a biological sample under conditions suitable for 
the formation of a ligand-polypeptide complex; and (b) detecting said complex. 

Protocols such as ELISA, RIA, and FACS for measuring polypeptide levels may additionally provide a 
basis for diagnosing altered or abnormal levels of polypeptide expression. Normal or standard values for 
polypeptide expression are established by combining body fluids or cell extracts taken from normal 
20 mammalian subjects, preferably humans, with antibody to the polypeptide under conditions suitable for 
complex formation The amount of standard complex formation may be quantified by various methods, 
such as by photometric means. 

Antibodies which specifically bind to a polypeptide of the invention may be used for the diagnosis of 
conditions or diseases characterised by expression of the polypeptide, or in assays to monitor patients 

25 being treated with the polypeptides, nucleic acid molecules, ligands and other compounds of the 
invention. Antibodies useful for diagnostic purposes may be prepared in the same manner as those 
described above for therapeutics. Diagnostic assays for the polypeptide include methods that utilise the 
antibody and a label to detect the polypeptide in human body fluids or extracts of cells or tissues. The 
antibodies may be used with or without modification, and may be labelled by joining them, either 

30 covalently or non-covalently, with a reporter molecule. A wide variety of reporter molecules known in the 
art may be used, several of which are described above. 

Quantities of polypeptide expressed in subject, control and disease samples from biopsied tissues are 
compared with the standard values. Deviation between standard and subject values establishes the 
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parameters for diagnosing disease. Diagnostic assays may be used to distinguish between absence, 
presence, and excess expression of polypeptide and to monitor regulation of polypeptide levels during 
therapeutic intervention. Such assays may also be used to evaluate the efficacy of a particular therapeutic 
treatment regimen in animaLstudies, in clinical trials or in monitoring the treatment of an individual 
5 patient. 

Diagnostic kits 

A diagnostic kit of the present invention may comprise: 

(a) a nucleic acid molecule of the present invention; 

(b) a polypeptide of the present invention; or 

10 (c) a ligand of the present invention. 

In one aspect of the invention, a diagnostic kit may comprise a first container containing a nucleic acid 
probe that hybridises under stringent conditions with a nucleic acid molecule according to the invention; a 
second container containing primers useful for amplifying the nucleic acid molecule; and instructions for 
using the probe and primers for facilitating the diagnosis of disease. The kit may further comprise a third 
15 container holding an agent for digesting unhybridised RNA. 

In an alternative aspect of the invention, a diagnostic kit may comprise an array of nucleic acid molecules, 
an array of antibody molecules, and/or an array of polypeptide molecules, as discussed in more detail 
above. 

Such kits will be of use in diagnosing a disease or susceptibility to disease, particularly inflammation, 
20 oncology, or cardiovascular disease. 

Various aspects and embodiments of the present invention will now be described in more detail by way of 
example, with particular reference to polypeptides regulated differentially under hypoxic conditions as 
opposed to normoxic conditions. It will be appreciated that modification of detail may be made without 
departing from the scope of the invention. 

25 Brief description of the Figures 

Figure 1 shows a scatter plot, showing normalised signal intensities in hypoxia versus normoxia, with 
each dot representing a single gene. 

Figure 2; Hypoxia responses amplified by HIF1 alpha overexpression. Data shown is the average of 4 
repeat experiments. Values represent fold change as compared to untreated cells (condition 1). Error bars 
30 represent standard error of the mean. 
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Figure 3: Hypoxia responses amplified by EPAS1 overexpression. Data shown is the average of 4 repeat 
experiments. Values represent fold change as compared to untreated cells (condition 1). Error bars 
represent standard error of the mean. 

Figure 4: Hypoxia responses amplified by HIFlalpha / EPAS1 overexpression. Data shown is the average 
5 of 4 repeat experiments. Values represent fold change as compared to untreated cells (condition 1). Error 
bars represent standard error of the mean. 

Figure 5 shows genes that are induced by hypoxia to a greater degree in resting macrophages, as 
compared to activated macrophages. Error bars show the standard deviation from both repeat experiments 
and multiple exposures from single experiments. These data are not shown in table form. All bars are 
10 ratios of mRNA expression in hypoxia/ normoxia. These are calculated separately for resting (light bars) 
and activated (dark bars) macrophages, and do not illustrate differences resulting from activation in 
normoxia. 

Figure 6 shows genes which are induced by hypoxia to a greater degree in activated macrophages, 
compared to resting macrophages. 

15 Figure 7 shows genes that are repressed by hypoxia to a greater degree in activated macrophages. 

For Figures 8, 9a, 9c, 10-32a, 32d and 33-52, mRNA levels, determined from a custom gene array, of 
particular genes are shown on the Y-axis, expressed as a value as compared to the median expression 
level of this gene throughout all samples. Eleven primary human cell types as shown on the x-axis were 
cultured in normoxia (black), or exposed to hyopxia for 6hr (grey) or 1 8hr (white). 

20 Figure 8: Ecotropic viral integration site 2A (Seq ID:475/476). 

Figure 9a: Novel PI-3-kinase adapter (Seq ID:79/80); Image clone accession R62339. 

Figure 9b: TaqMan Real-time Q-RT-PCR data for Novel PI-3-kinase adapter (Seq ID:79/80); Image 
clone accession R62339. 

Figure 9c: IMAGE clone acc R59598 (Syk). 
25 Figure 10: Regulator of G-protein signalling 1 (Seq ID:375/376) 

Figure 11: GM2 ganglioside activator protein (Seq ID:389/390) 

Figure 12: Hypothetical protein PRO0823 (Seq ID:21/22) 

Figure 13: CYP1 (cytochrome P450, subfamily XXVIIB) (Seq ID:339/340) 

Figure 14: Alpha-2-macroglobulin (Seq ID:405/406) 
30 Figure 15: Interleukin 1 receptor antagonist (Seq ID :357/358) 



WO 02/46465 



PCT/GB01/05458 



64 

Figure 16: SCYA3L (Seq ID:469/470) 

Figure 17: CFFM4 (Seq ID:433/434) 

Figure 18: Pleckstrin (Seq ID:431/432) 

Figure 19: CYP1B1 (SeqID :325/326) 
5 Figure 20: CYP1B1 (SeqID :1 37/1 3 8) 

Figure 21: Hypothetical protein FLJ13511 (SeqID : 1 63/1 64) 

Figure 22: Hematopoietic Zinc finger protein (SeqID:17/18) 

Figure 23: Osteopontin (SeqID :267/268) 

Figure 24: Osteopontin (SeqID:267/268) 
10 Figure 25: Adipophilin (SeqID :3 1 3/3 1 4) 

Figure 26: Adipophilin (SeqID :3 1 3/3 1 4) 

Figure 27: Adipophilin (SeqID :3 1 3/3 1 4) 

Figure 28: Adipophilin (SeqID :3 1 3/3 1 4) 

Figure 29: Hypothetical protein FLJ22690 (SeqID:205/206) 
15 Figure 30: cDNA DKFZp586E1624 (SeqID: 65/66) 

Figure 31: EST (SeqID : 1 97/1 98) 

Figure 32a: EGL nine (C.elegans) homolog 3 (SeqID:85/86) 

Figure 32b: Gene expression profiles in macrophages with and without activation,' mRNA levels, 
determined from a custom gene array, of clorfl2 are shown on the Y-axis, expressed as a value compared 
20 to the mean value of a set of control genes on each array (per-chip normalisation). All cells were human 
macrophages, cultured either without cytokines or with IL-10 or with the combination of IFND and LPS 
in normoxia and hypoxia. 

Figure 32c: Gene expression profiles in macrophages with and without activation. mRNA levels, 
determined from a custom gene array, of EGLN3 are shown on the Y-axis, expressed as a value compared 
25 to the mean value of a set of control genes on each array (per-chip normalisation). All cells were human 
macrophages, cultured either without cytokines or with IL-10 or with the combination of IFND and LPS 
in normoxia and hypoxia. 

Figure 32d: Clorfl2 (SeqID: 89.90) 
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Figure 32e: The effect of EPAS/ HIF overexpression on expression of the gene C 1 orf 12 EGLN genes 
using a custom gene array. mRNA expression levels of the gene clORF12 as determined by the custom 
array, in response to hypoxia and adenoviral over-expression of HIF or EPAS are shown. Experimental 
conditions are as follows: #1 no adeno / normoxia; #2 empty adeno (low dose)/ normoxia; #3 empty 
5 adeno (high dose)/ normoxia; #4 empty adeno (low dose)/ hypoxia; #5 empty adeno (high dose)/ hypoxia; 
#6 HIF-1 adeno (low dose)/ hypoxia; #7 HIF-1 adeno (high dose)/ hypoxia; #8 EPAS adeno (low dose)/ 
hypoxia; #9 EPAS adeno (high dose)/ hypoxia, Error bars are the standard error of the mean. 

Figure 32f: The effect of EPAS/ HIF overexpression on expression of the gene EGLN3 gene using a 
custom gene array. mRNA expression levels of the gene EGLN3 as determined by the custom array, in 
10 response to hypoxia and adenoviral over-expression of HIF or EPAS are shown. Experimental conditions 
are as follows: #1 no adeno / normoxia; #2 empty adeno (low dose)/ normoxia; #3 empty adeno (high 
dose)/ normoxia; #4 empty adeno (low dose)/ hypoxia; #5 empty adeno (high dose)/ hypoxia; #6 HIF-1 
adeno (low dose)/ hypoxia; #7 HIF-1 adeno (high dose)/ hypoxia; #8 EPAS adeno (low dose)/ hypoxia; 
#9 EPAS adeno (high dose)/ hypoxia. Error bars are the standard error of the mean. 

15 Figure 32g: The effect of EPAS/ HIF overexpression on expression of the EGLN3 gene using AffyMetrix 
Hu95 ver2 GeneChips. mRNA expression levels of the gene in response to hypoxia and adenoviral over- 
expression of HIF or EPAS are shown. Graphs show the mean of two replicate arrays, with error bars as 
standard deviation. Above each graph, data values are shown, including the normalised values and raw 
values (the AffyMetrix average difference parameter) and Present/ Absent flags. 

20 Figure 32h: The effect of EPAS/ HIF overexpression on expression of the cl orf 1 2 gene using AffyMetrix 
Hu95 ver2 GeneChips. mRNA expression levels of the gene in response to hypoxia and adenoviral over- 
expression of HIF or EPAS are shown. Graphs show the mean of two replicate arrays, with error bars as 
standard deviation. Above each graph, data values are shown, including the normalised values and raw 
values (the AffyMetrix average difference parameter) and Present/ Absent flags. 

25 Figure 32i: Flag immunocytochemistry in HEK293T cells 

Figure 32j: Human Cardiomyocyte Caspase Activity after 72 hours transduction with EIAV-ELG9- 
Homolog 3 

Figure 33: Novel Metallothionein (SeqID :83/84) 
Figure 34: Hypothetical protein hqp0376 (SeqID :337/338) 
30 Figure 35: Metallothionein 2A (SeqID :265/266) 
Figure 36: Metallothionein 1G (SeqID :243/244) 
Figure 37: Metallothionein 1H (SeqID: 239/240) 
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Figure 38: Hepcidin antimicrobial peptide (SeqID : 1 4 1 /l 42) 
Figure 39: EST (SeqID: 117/118) 

Figure 40: Hypothetical protein FLJ22622 (SeqID :1 29/1 30) 

Figure 41: TRIP-Br2 (SeqID :3 1/32) 
5 Figure 42: Tumor protein D52 (SeqID :301/302) 

Figure 43: Semaphorin 4b (SeqID :91/92/92a) 

Figure 44: Dec-1 (SeqID :37 1/372) 

Figure 45: Calgranulin A (SeqID ;447/448) 

Figure 46: EROl (S. cerevisiae)-like (SeqID :67/68) 
10 Figure 47: Hypothetical protein FLJ20500 (SeqID :25/26) 

Figure 48: N-myc downstream regulated (SeqID :229/230) 

Figure 49: Decidual protein induced by progesterone (SeqID :387/388) 

Figure 50: Integrin, alpha 5 (SeqID :379/380) 

Figure 51: Tissue factor (SeqID :225/226) 

15 Figure 52: C0X-2 (SeqID :237/23 8) 

Figure 53: Genes up-regulated by macrophage activation, Normalised mRNA levels in the 6 experimental 
conditions (#1 no cytokines/ normoxia, #2 no cytokines/ hypoxia, #3 IL-10/ normoxia, #4 IL-10/ hypoxia, 
#5 LPS/IFN/ normoxia, #6 LPS/IFN/ hypoxia) are shown as values referenced to the median value of that 
gene throughout all 6 experimental conditions. Error bars show the standard error of the mean, 

20 Figure 54: Genes downregulated by macrophage activation (I) 

Figure 55: Genes downregulated by macrophage activation (II) 

Figure 56: Genes downregulated by macrophage activation (III) 

Figure 57 shows an RNase protection assay for the gene encoding Semaphorin 4b. 

Figure 58 shows a Northern blot showing the size of the mRNA and tissue distribution for the 
25 Semaphorin 4b gene. 
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Examples 

Summary 

Subtracted cDNA libraries were separately prepared for hypoxic macrophages and cardiomyoblasts. This 
involved harvesting RNA from cells both in normoxia and hypoxia, and preparing cDNA. Subtractive 
5 hybridization / suppression PCR was then performed to remove genes from the hypoxic cell cDNA, 
which are also present in cDNA from normoxic cells. Insert DNA from the libraries was PCR amplified 
and arrayed onto duplicate membranes. Quantitative hybridizations with pre-library cDNA material 
(normoxia and hypoxia) were done to identify clones in the libraries that actually contain hypoxia 
inducible genes. The insert DNA was then sequenced, 

10 This procedure was done independently for macrophage and cardiomyoblast. The hypoxia inducible 
genes identified from these different cell types differed widely, with only a minority of these, genes being 
identified from both cell types. 

To characterise the differences between the two tissues further, arrays were produced containing all 
confirmed hypoxia-inducible genes from the macrophage library. Replicate arrays were hybridised with 
15 cDNA from normoxic and hypoxic cardiomyoblasts to allow quantitative evaluation of these genes in the 
cardiomyoblast. This revealed quantitative differences in the hypoxia induced activation these genes in 
the two cell types. 

Example la: Comparison of the hypoxic-response between human macrophages and 
cardiomyoblasts by a subtraction cloning / array screening approach 

20 Methods / Results 

To isolate human macrophage, monocytes were derived from peripheral blood of healthy human donors. 
100ml bags of buffy coat from the Bristol Blood Transfusion Centre were mixed with an equal volume of 
RPMI1640 medium (Sigma). This was layered on top of 10ml ficol-paque (Pharmacia) in 50ml centrifuge 
tubes and centrifuged for 25 min at 800 x g. The interphase layer was removed, washed in MACS buffer 

25 (phosphate buffered saline pH 7.2, 0.5% bovine serum albumin, 2mM EDTA) and resuspended at 80 
microliter per 10n7 cells. To this 20 microliter CD 14 Microbeads (Miltenyi Biotec) were added, and the 
tube incubated at 4 degrees for 15 min. Following this one wash was performed in MACS buffer at 400 x 
g and the cells were resuspended in 3 ml MACS buffer and separated on an LS+ MACS Separation 
Column (Miltenyi Biotec) positioned on a midi-MACS magnet (Miltenyi Biotec). The column was 

30 washed with 3 x 3ml MACS buffer. The column was removed from the magnet and cells were eluted in 5 
ml MACS buffer using a syringe. Cells were washed in culture medium (AIM V (Sigma) supplemented 
with 2% human AB serum (Sigma), and resuspended at 2 x 10n5 cells per ml in the same medium and 
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placed in large teflon-coated culture bags (Sud-Laborbedarf GmbH, 82131 Gauting, Germany) and 
transferred to a tissue culture incubator (37 degrees, 5% C02) for 7-10 days. During this period 
monocytes spontaneously differentiate to macrophages. This is confirmed by examining cell morphology 
using phase contrast microscopy. Cells are removed from the bags by placing at 4 degrees for 30 min and 
5 emptying the contents. The cells are then washed and resuspended in culture medium at 5 x 105 cell/ml 
and plated out in Primeria 10 cm tissue culture petri dishes (Falcon Becton Dickinson) at 5 x 10n6 cells 
per dish. Culture is continued for 16-24hr to allow cell adherence, prior to experimentation involving 
hypoxia, 

As an alternative primary cell type human cardiomyoblast cultures were established, Cells derived from 
10 the ventricular tissue of newborn or foetal hearts were purchased from BioWhittaker (CC-2582). Growth 
conditions were used to allow maximum expansion of the cells in vitro, by using a medium rich in growth 
factors. Under such conditions cardiomyoblast-like cells predominate (the developmental precursor of 
cardiomyocytes). This has been previously described by Goldman and Wurzel (In Vitro Cell Dev. Biol 
28A: 109-1 19 (1992)) and Goldman et al, (1996, Exp.Cell.Res. 228(2): 237-245). 

15 For these cultures, cells were seeded at lxlO 6 per T150 flask in human smooth muscle growth medium 
(TCS CellWorks ZHM-3935) and were expanded in the same medium up to a maximum number of 4 
passages. The growth medium is purchased pre-prepared, and includes in the formula, 5% fetal bovine 
serum, insulin, epidermal growth factor and fibroblast growth factor. Prior to experimentation involving 
hypoxia, cells were plated onto 10 cm tissue culture petri dishes and allowed to reach confluency. 

20 For experimentation with hypoxia, for all cell types, an equal number of identical culture dishes were 
divided into two separate incubators: One at, 37 degrees, 5% C02, 95% air (=Normoxia) and the other at 
37 degrees, 5% C0 2 , 94.9% Nitrogen, 0.1% Oxygen (=Hypoxia). After 6 hours culture under these 
conditions, the dishes were removed from the incubator, placed on a chilled platform, washed in cold PBS 
and total RNA was extracted using RNazol B (Tel-Test, Inc; distributed by Biogenesis Ltd) following the 

25 manufacturer's instructions. Polyadenylated mRNA was extracted from the total RNA using a 
commercial kit following the manufacturer's instructions (Promega; PolyATract mRNA isolation System 
IV). 

The hypoxia period of 6 hr was previously determined to be sufficient to allow the induction of known 
hypoxia-regulated genes, as determined by RNase protection assays. During these preliminary studies it 
30 was noted that macrophages, cardiomyoblasts and an additional control cell type, Jurkat T -cells, showed 
different patterns of gene induction in response to hypoxia: 

Known Hypoxia-inducible gene level of hvpoxia-induced increase in mRNA levels 

Macrophage Myoblast T-cell 
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phosphoglycerate kinase-1 
(PGK) 

vascular endothelial growth factor-A 
(VEGF) 

5 solute carrier family 2, member 1 
(Glut-1) 



high 



high 



none 



low 



low 



none 



high 



high 



high 



Separate subtracted cDNA populations were generated from mRNA extracted from hypoxic macrophages 
and hypoxic cardiomyoblasts, using a combination of two kits, purchased from Clontech Laboratories- 
SM ART PCR cDNA synthesis kit and PGR Select cDNA subtraction kit. The manufacturer's instructions 
10 were followed for both kits, All diagnostic steps were followed as recommended by the manufacturers, 
All PCR reactions were done using an Applied Biosystems 9700 with 96-well block, using Applied 
Biosystems plastics, Driver and tester populations used for subtraction were as below: 

subtracted cDNA tester driver 

Subtracted macrophage macrophage (hypoxia) macrophage (normoxia) 

Subtracted cardiomyoblast cardiomyoblast (hypoxia) cardiomyoblast (normoxia) 

15 The final subtracted cDNA samples were evaluated by performing RT-PCR using the following primers 
for human beta actin: 

sense: TCACCCACACTGTGCCCATCTACGA 
antisense: CAGCGGAACCGCTCATTGCCAAATGG 

This showed that an additional 5 cycles of PCR were required to achieve similar levels of beta actin 
20 product from subtracted compared to unsubtracted cDNA, indicating a significant reduction in the 
representation of a non-regulated gene in the subtracted cDNA. Glyceraldehyde 3-Phosphate 
dehydrogenase PCR primers, as contained in the kit, were not used. 

The three subtracted cDNA populations were ligated into a plasmid vector (pCRII, Invitrogen) to 
generate libraries, which were transformed into E.coli (ENVaF', Invitrogen) and plated out onto agar, 
25 supplemented with ampicillin and X-Gal, according to standard methods. 

Colonies that are white indicate the presence of a recombinant plasmid, and these were picked into 
individual wells of 96-well plates containing 100 microliters LB -Ampicillin, and given 3-8 hr growth at 
37 degrees. In this way, for each library, up to 15 x 96-well plates of clones were generated. 
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To screen clones for the presence of differentially expressed genes, replicate arrays of plasmid insert 
DNA were generated on nylon membranes: Firstly, PCR was performed using nested PGR primers 2R 
and 1, which flank the cDNA insert of each clone (sequence described in the PCR Select kit), The 
reaction mix also contains 200 uM d(A,T,C,G)TP, Advantage2 polymerase mix (Clontech Laboratories) 
5 and supplied lOx buffer, 40 ul reactions were set up in 9 6 -well PCR reaction plates and inoculated with 
0.5 ul bacteria from the library plates. 23 cycles of PCR were performed (95 degrees 10 sec; 68 degrees 2 
min), and a selection of wells were checked on an agarose gel. In this manner a 96-well plate of insert 
DNA was generated for each 96-well plate of bacterial clones. Arrays of insert DNA were generated by 
transferring 4ul of each well to 384-weIl plates (Genetix), and denaturing the DNA by adding 4ul 0.4M 
10 NaOH and incubating at 37 degrees for 15 minutes. Bromophenol blue was added to the wells to allow 
visualisation of arraying. A 384-pin replicator (Genetix) was used to spot small volumes of denatured 
insert DNA onto dry nylon membranes (Hybond N+, AmershamPharmacia), 

By repeating this operation from the same 384-well plate onto several membranes, matched pairs of 
membranes were produced, suitable for array screening. A fragment of the beta actin gene was spotted at 
15 specific positions of the arrays. Following spotting, the membranes were left at room temperature for 2 hr, 
re-denatured by placing on chromatography paper wetted with 0.3 M NaOH, neutralised by placing on 
chromatography paper wetted with 0,5 M Tris pH 7,5, dried at room temperature for 2 hr and crosslinked 
by exposing to 2000 joules UV radiation. Prior to hybridisation, residual salts were removed from the 
arrays, by washing in hot 0.5% SDS. 

20 Matched pairs of membranes were hybridised with subtracted cDNA samples; from hypoxic and 
normoxic cells, to determine the abundance of the genes corresponding to each spotted clone in the cDNA 
samples, Because the cDNA probes were subtracted, large differences in the hybridisation signal for 
individual spots were apparent, which can be identified by eye. Prior to probe labelling, subtracted cDNA 
samples were digested with Rsal and run through Qiagen Qiaquick PCR purification columns to remove 

25 adapter sequences added during the PCR Select procedure. 25 ng cDNA was labelled with 33P using a 
commercial kit following the manufacturer's instructions (Promega, Prime-a-gene kit), and 
unincorporated label was removed using BioRad Biospin-6 columns following adding 2.5ug yeast tRNA 
carrier. 

Pre-hybridisation, hybridisation and washes were performed essentially according to the Research 
30 Genetics GeneFilters protocol, but supplementing the hybridisation mixture with 10 ug of a cocktail of 
oligonucleotides complementary to the Clontech PCR Select nested PCR primers (equimolar mix of 
primers 1 and 2R and their reverse complements). 



WO 02/46465 



PCT/GB01/05458 



71 

Hybridized arrays were exposed to X-ray film or were exposed to a ■ phosphorimager (Molecular 
Dynamics, Storm) and clones showing gross differences in the hybridization signals with hypoxic 
compared to normoxic cDNA probes were identified. This procedure was used to process all clones 
originally picked from the primary libraries and PCR amplified. The selected clones were grouped 
5 together onto a single array (referred to here as a secondary array), and were re-screened with cDNA 
probes which had not been subtracted, to allow a more quantitative though less sensitive, evaluation of 
the relative abundance of the genes in hypoxia vs. normoxia. 

In this case, probes were ds cDNA generated from the Clontech SMART cDNA synthesis kit (labelled 
using the Promega Prime-a-gene kit) or were total RNA (labelled according to the Research Genetics 
10 GeneFilters protocol), and hybridisations were done according to the Research Genetics GeneFilters 
protocol. 

Hybridization signals were measured using a phosphorimager and were processed with ArrayVision 
(Imaging Research Inc) software using multiple beta-actin spots to normalise the quantitation and 
individual spot background correction. At this stage, the inserts of clones showing consistent up- 
15 regulation in hypoxia were sequenced using the 2R primer. 

The identity of the genes were determined using BLAST at the NCBI (NLM, NIH) against the non- 
redundant data base collection. Where significant matches to human genes were not made, the human 
EST database was used. For both EST and non-EST hits, identifier numbers were also obtained from the 
UniGene database. 

20 The above strategy was used independently for libraries derived from macrophages and from 
cardiomyoblasts. By screening a relatively large number of clones (several thousand per library), single 
genes were identified from multiple clones from any individual library, Multiple clones covered either the 
same or different regions of the genes. 

In the above manner, certain hypoxia-inducible genes were identified from clones only derived from the 
25 cardiomyoblast library. These genes are listed in Table 1 . Certain hypoxia-inducible genes were identified 
from clones only derived from the macrophage libraries. These genes are listed in Table 2. Certain 
hypoxia-inducible genes were identified from clones derived from both macrophage and myoblast 
libraries. These genes are listed in Table 3. 

It can be seen that Table 3 contains many less genes than either Tables 1 and 2; demonstrating that these 
30 cell types have large differences in the genes induced by hypoxia. Importantly, the subtracted libraries for 
macrophage and cardiomyoblast were constructed in parallel. Therefore, major differences in the 
spectrum of genes isolated from these libraries are likely to be due to differences in the starting material, 
rather than due to technical differences in the production of the libraries. Importantly, the genes contained 
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in these tables were confirmed to be hypoxia-regulated in the relevant cell type(s) by the described two- 
stage array hybridisation screening process. 

From Table 3 it is clear that although this subset of genes was found in subtracted libraries from both 
hypoxic macrophages and cardiomyoblasts, the fold-induction obtained between hypoxia and normoxia, 
5 for the different tissues differs widely. For the first 5 genes in this table, the hypoxia response is greater 
for macrophages, whereas for the last 2 genes it is greater for cardiomyoblasts. 

To test whether genes isolated only in the macrophage-derived subtracted libraries are not responsive to 
hypoxia in cardiomyoblast, cardiomyoblast cDNA isolated from normoxic and hypoxic cells was 
hybridised to an array of macrophage-derived clones. These data are presented as a scatter plot, showing 

10 normalised signal intensities in hypoxia versus normoxia, with each dot representing a single gene on the 
array. This plot is presented in Figure 1. A gene that is not affected by hypoxia will localise around the 
y=x line, running diagonally through the centre of the graph. From the figure, it can be seen that most 
genes lie in this region, even though all the genes were responsive to hypoxia in the macrophage (Table 
2). There is a subset of genes that lie beneath this region (x>y), representing induction of these genes by 

15 hypoxia in the cardiomyoblast. 

Sequence data for the cDNA inserts of clones from our custom subtracted cDNA libraries is available. 
These are usually short fragments of 300-1000 bp. Some have been resequenced to obtain an accurate full 
insert sequence (see document "gene sequences/analysis"). 

Several of the genes presented in Tables 1-3 encode hypothetical proteins of unknown function and others 
20 have no database matches with protein coding sequence. The work presented here provides some 
functional annotation for these genes, and potential applications for the treatment of disease, Certain 
genes, in particular the glycolytic enzymes and transporters, have been hypothesised previously as 
forming part of the generic hypoxia response, The data provided herein provide solid, validating data for 
these hypotheses, 

25 It was surprising to note that cells from our cultures of human ventricle-derived cells, showing a 
cardiomyoblast-like phenotype, do not support significant induction of the following genes: Lactate 
dehydrogenase A„ Enolase 1, Phosphoglycerate kinase 1, Triosephosphate isomerase 1, These genes have 
been identified as being targets of the "ubiquitous" transcription factor HIF-1 alpha ("HIF-1: mediator of 
physiological and pathophsiological responses to hypoxia" J.Appl.Physiol 88: 1474-1480 (2000)), 

30 Example lb: Preparation of custom array 

To confirm the findings presented in Example la, and to obtain more accurate and additional data, both 
the subtracted cDNAIibrary clones and the IMAGE clones identified from the Research Genetics Human 
GeneFilters have now been fabricated by the authors into an independently produced and verified gene 
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array (referred to herein as the "custom gene array"), composed of PCR-amplified insert DNA. The 
methods used to produce this array are common in the art, but the key points are summarised below. 

Clones from the subtracted cDNA library were PGR amplified as defined in Example la. In many cases, 
there were multiple cDNA clones corresponding to different regions of the same gene, and all these were 
5 represented on the custom gene array. IMAGE clones were obtained from the UK MRC HGMP Resource 
Centre (Hinxton, Cambridge CB10 1SB, UK) and were re-isolated as individual colonies and sequenced' 
to verify the correct identity of the clone. In the majority of cases, the same IMAGE clone identified from 
the Research Genetics Human GeneFilters was selected, but in some instances these clones were not 
available and alternatives were selected, corresponding to the same gene. 

10 Additional genes, with well-defined roles in various disease processes relevant to hypoxia, were also 
represented on the array, as derived from IMAGE clones, It is well established in the literature that genes 
with similar functions are often co-regulated at the mRNA level, as determined by microarray data 
clustering methods (Iyer VR et al, Science, 1999 283 (5 3 9 8):8 3 -7 ; Eisen MB et al Proc Natl Acad Sci U S 
A. 1998 95(25):14863 -8). This allows associations to be made between genes of unknown function (as 

15 present in the current specification) to genes of well defined function, in order to add significance to the 
former. 

Normalisation is a key issue in array analysis. The custom gene array is a single colour type array, and 
contains a selection of additional IMAGE clones corresponding to genes which were empirically 
determined not to be affected by hypoxia and which are highly expressed in a wide range of human 
20 tissues and cell types. During data analysis, spot intensities were divided by the mean of all the reference 
genes shown below, each of which was present in quadruplicate on each array. 

Gene IMAGE clone Acc. 



FLJ1 1 102 fis clone PLACE1005646 



AA464704 



25 matrix Gla protein 

guanine nucleotide binding protein alpha stimulating 1 
DKFZp434A1319 

cDNA FLJ23280 fis clone HEP07194 



W74725 



AA669443 



R43581 



AA155913 



beta actin 



(in house clone) 



30 EFla-like protein 

ribosomal protein L37a 



W91881 



AI817566 



IMAGE clone plasmid miniprep DNA was prepared and PGR amplified with flanking vector primers of 
the sequences GTTTTCCCAGTCACGACGTTG and TGAGCGGATAACAATTTCACACAG. This was 
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then purified and concentrated by ethanol precipitation, and the presence of a single band and DNA 
concentration were determined by agarose gel electrophoresis and by digital imaging methods. 

Purified PCR product corresponding to all the clones (IMAGE and non-IMAGE) were normalised to 0,5 
mg/ ml by dilution, Arrays were fabricated onto Hybond N+ (Amersham) membranes using a 
5 BioRobotics TAS arrayer (Biorobotics, Cambridge CB37LW, UK) with a 500 micron pin tool. Using 
384-well source plates and a 2x2 arraying format this array was relatively low density, thereby 
eliminating problems of spot-to-spot signal bleed. Also the large pin size and high source plate DNA 
concentration improves the sensitivity of detection, Post-arraying denaturation/ neutralisation was 
essentially as described by Bertucci F et al, 1999 {Oncogene 18: 3905-3912), 

10 Total RNA was extracted from cells using RNeasy (Qiagen) and 7 micrograms RNA was labelled with 
100 microCi 33P dCTP using 2 micrograms poly dT (10-20 mer) as primer in a reverse transcription 
reaction, First strand RNA was then degraded under alkaline contitions, and this was then neutralised with 
Tris HC1 pH 8.0, and the labelled cDNA was purified using BioRad BioSpin-6 chromatography columns. 
Pre-hybridisation was performed in 4 ml Research Genetics MicroHyb solution supplemented with 

15 lOmicrograms poly dA (10-20 mer) and 10 micrograms Cot-1 DNA, at 45 degrees for 2-3 hours. The 
cDNA was then denatured by heating and added to the pre-hybridisation, which was continued for 18- 
20hr, Washing steps were done as follows: 2xSSC/ 1% SDS 2x20min at 50 degrees and 0.5xSSC/ 1% 
SDS lOmin at 55 degrees. Arrays were exposed to'Amersham Low Energy phosphor screens for 24hr and 
scanned using a phosphorimager at 50 micron resolution. Image analysis was done using ArrayVision 

20 software (Imaging Research Inc). Tab delimited data files were exported and a full analysis performed 
using GeneSpring software (Silicon Genetics). 

Using the described methodology a dynamic range of detection of 4 logs and a sensitivity of at least 1 / 
50,000 is obtained, as determined by spike doping titration experiments. Having several technical 
differences compared to the Research Genetics Human GeneFilters as used in the initial filing, data from 
25 the custom gene array is expected to be quantitatively different. 

Example lc: Hypoxia regulation of gene expression in macrophages by exposing cells to hypoxia +/« 
additional signal amplification. 

The transcription factor HEF-la, is ubiquitously present in cells and is responsible for the induction of a 
number of genes in response to hypoxia. This protein is considered a master regulator of oxygen 
30 homeostasis (see, for example, Semenza, (1998) Curr. Op, Genetics and Dev. 8:588-594). Although HIF- 
la is well known to mediate responses to hypoxia, other transcription factors are also known or suspected 
to be involved. These include a protein called endothelial PAS domain protein 1 (EPAS1) or HIF-2a, 
which shares 48% sequence identity with HIF-la (Tian H, et al. Genes Dev. 1997 11:72-82.). Evidence 
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suggests that EPAS 1 is especially important in mediating the hypoxia-response in certain cell types, and it 
is clearly detectable in human macrophages, suggesting a role in this cell type (Griffiths et aL, 2000, Gene 
Then, 7(3):255-62). 

As supporting evidence for the hypoxic regulation of the genes contained within this specification, 
5 adenoviral vectors were used to overexpress HIF-la and EPAS1 in primary human macrophages prior to 
exposure to hypoxia, in order to amplify the response. Because the role of these transcription factors as 
mediators of the hypoxia response is very well established, any further increases in the inducibility of 
specific genes resulting from this approach represents credible supporting evidence that those genes are 
responsive to hypoxia, 

10 A commercially available system was used herein to produce adenoviral particles involving the 
adenoviral transfer vector AdApt, the adenoviral genome plasmid AdEasy and the packaging cell line 
Per-c6 (Crucell, Leiden, The Netherlands). The standard manufacturer's instructions were followed. 
Three derivatives of the AdApt transfer vector have been prepared, named AdApt ires-GFP, AdApt HIF- 
la-ires-GFP and AdApt EPAS 1 -ires-GFP. In these vectors, for convenience, AdApt was modified such 

15 that inserted genes (i.e. HIF-la or EPAS1) expressed from the powerful cytomegalovirus (CMV) 
promoter were linked to the green fluorescent protein (gfp) marker, by virtue of an internal ribosome 
entry site (ires). Therefore presence of green fluorescence provides a convenient indicator of viral 
expression of HIF-la or EPAS1 in transduced mammalian cells. The control vector AdApt ires-GFP was 
used to allow discrimination between effects of the inserted genes (i.e. HIF-la or EPAS1) to that of 

20 potential non-specific effects of adenoviral transduction or GFP expression. Standard subcloning methods 
were used to construct the adenoviral constructs as described in detail elsewhere (see co-pending, co- 
owned International patent application PCT/GB01/00758; Example 2). 

The adenoviral transfer vectors AdApt HIF-la-ires-GFP and AdApt EPASl-ires-GFP, were verified prior 
to production of adenoviral particles, for their ability to drive expression of functionally active HIF-la or 
25 EPAS1 protein from the CMV promoter in mammalian cells. This was achieved by transient transfection 
luciferase-reporter assays as described (Boast K et al Hum Gene Ther. 1999 Sep 1 ;1 0:21 97-208). 

Using the aforementioned Introgene adenoviral system, caesium-banded, pure adenoviral particles were 
produced for each of the vectors AdApt ires-GFP, AdApt HIF-la-ires-GFP and AdApt EPASl-ires-GFP. 
Following the Introgene manual, adenoviral preparations were quantitated by spectrophotometry, yielding 
30 values of viral particles (VP) per milliliter. 

Primary human macrophages isolated as described above, were washed and resuspended in DMEM 
(Gibco, Paisley, UK) supplemented with 4% fetal bovine serum (Sigma). 5xl0 6 cells were plated into 
nine individual 10cm Primeria (Falcon) tissue culture dishes containing medium plus adenovirus as 
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shown below (experimental design), to give a total volume of 10 ml per plate.- Two doses of adenovirus 
were used; 5 .3 x 1 0 8 viral particles / ml (low) and 1.6xl0 9 viral particles / ml (high), These amounts were 
chosen following a series of titration experiments. Following culture for 16 hr, during which the 
macrophages adhere to the plate and are infected by the adenoviral particles, the medium was removed 
5 and replaced by IMDM medium (Gibco, Paisley, UK) supplemented with 2% human AB serum. A 
further 24 hr period of culture was allowed prior to experimentation, to allow gene expression from the 
transduced adenovirus. Gene transduction was verified by visually assessing gfp expression and 
expression of the viral HIF-la and EPAS1 genes was determined by real time quantitative RT-PCR using 
an ABI Prism 7700 TaqMan and CyberGreen protocol. For the high doses of virus, the total levels of 
10 HIF-la or EPAS1 mRNA present in the transduced cells were increased by 10-30 fold. 

For experimentation with conditions of hypoxia, identical culture dishes were divided into two separate 
incubators: One at 37 degrees, 5% C02, 95% air (=Normoxia; equivalent to 20% Oxygen) and the other 
at 37 degrees, 5% C02, 94.9% Nitrogen, 0.1% Oxygen (=Hypoxia). After 6 hours culture under these 
conditions, the dishes were removed from the incubator, placed on a chilled platform, washed in cold PBS 
15 and total RNA was extracted using RNeasy (Qiagen) following the manufacturer's instructions. 



Experimental design 



Condition 


Adenovirus 


Adenovirus 


Oxy 




(type) 


amount 


(%) 






(low=5.3xl0 8 vp/ml 








high=1.6xl0 9 vp/ml) 




1 


none 


none 


20 


2 


AdAptires-GFP 


low 


20 


3 


AdAptires-GFP 


high 


20 


4 


AdAptires-GFP 


low 


0.1 


5 


AdAptires-GFP 


high 


0.1 


6 


AdAptHIF-la-ires-GFP 


low 


0.1 


7 


AdAptHIF-la-ires-GFP 


high 


0.1 


8 


AdApt EPASl-ires-GFP 


low 


0.1 


9 


AdApt EPASl-ires-GFP 


high 


0.1 



30 

RNA samples from the experimental conditions shown above were each hybridised to individual copies 
of the Custom gene array and processed as described earlier. To ensure reproducible data, this was 
repeated so each RNA sample was hybridised to 4 separate arrays. Therefore a total of 36 arrays were 
used for this experiment. Data analysis was done taking the mean signal of each spot from the four array 
35 replicates of each RNA sample. When displayed graphically, standard error of the mean is displayed as 
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the error bar. Expression values were calculated so that they represent the fold-change ratio as compared 
to condition#l, i.e. untreated cells. 

For genes shown in Table 4 it can be seen that in cells transduced by the control adenovirus AdApt ires- 
GFP there is a response to hypoxia (conditions 4,5) as compared to in normoxia (conditions 2,3), 
5 However this response is significantly greater when the natural hypoxia response is amplified by 
overexpression of HIF-1 alpha from the adenovirus AdApt HIF-la-ires-GFP (conditions 6,7), 
Furthermore, this effect is usually dependent on the amount of HIFlalpha overexpression (i.e. greater in 
condition 7 compared to 6), This same data is displayed graphically in Figure 2. It can be seen that these 
genes encode metallothionein proteins, One of these (Nucleotide Seq ID No. 84; Protein Seq ID No. 83) 
10 is a novel member of the matallothionein family, Several metallothionein genes are known in the art to be 
activated by hypoxia, supporting the usefulness of this data. 

For genes shown in Table 5 and Figure 3 it can be seen that in cells transduced by the control adenovirus 
AdApt ires-GFP there is a response to hypoxia (conditions 4,5) as compared to in normoxia (conditions 
2,3). However this response is significantly greater when the natural hypoxia response is amplified by 
15 overexpression of EPAS1 from the adenovirus AdApt EPAS1 -ires-GFP (conditions 8,9). 

In the case of the protein encoded by Seq ID No. 24, results are available independently for two separate 
cDNA clones representing non-overlapping regions of the same full length gene. 

In the case of the protein encoded by Seq ID No. 86 (EGL nine (C.elegans) homolog 3), additional 
evidence is described above in support of the function of this protein. Furthermore, real time quantitative 

20 RT-PCR analysis of this gene using an ABI Prism 7700 TaqMan and CyberGreen protocol, has been 
performed, to verify and more accurately quantitate the upregulation of EGL nine (C.elegans) homolog 3 
in response to hypoxia and EPAS1 adenoviral overexpression, The main difference between the array- 
based and real time quantitative RT-PCR methodologies is that the latter is far more sensitive and 
therefore can detect expression in the off-state (here normoxia) for weakly expressed genes. This data has 

25 shown an induction ratio of 819-fold for EGL nine (C.elegans) homolog 3 in response to hypoxia with 
additional EPAS1 expression, from RNA generated from an independent experiment. This data was 
normalised to beta actin. 



Similarly another weakly-expressed EPASl-induced gene,' Semaphorin 4b (Seq ID No. 91/92; see 
additional discussion above) has been determined using real time quantitative RT-PCR methodology, 
30 showing an actin-normalised induction ratio of 30.1 is found (data not shown), 
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For the gene shown in Table 6 and Figure 4 it can be seen that in cells transduced by the control 
adenovirus AdApt ires-GFP, there is a negative response to hypoxia (conditions 4,5) as compared to in 
normoxia (conditions 2,3). However, this response is significantly greater when the natural hypoxia 
response is amplified by overexpression of HIF1 alpha or EPAS1 (conditions 6,7,8,9), 

5 Example 2: Differences in the hypoxia responses of resting and activated macrophages. 

Macrophages accumulate at hypoxic areas in various disease states, including cancer, rheumatoid 
arthritis, atherosclerosis and wound healing, At these sites macrophages activation is liable to occur, such 
as in response to T-cell derived gamma interferon. For instance, in atherosclerotic plaques there is an 
accumulation of both T-cells and macrophages, and these are known to interact with one another 
10 (reviewed in Lusis AJ, Atherosclerosis. Nature. 2000 Sep 14;407(6801):233~41). 

It is well established that the macrophage has a significant role in the pathology of the above diseases 
involving hypoxia, and that most functions of the macrophage (including inflammatory functions) are 
greatly increased following activation. Therefore any therapeutic strategy aimed at the hypoxic 
macrophage, needs to also consider the effects of macrophage activation and possible cross talk between 
15 the responses to macrophage activation and hypoxia. 

2.1: Research Genetics Human GeneFilters 

This work was carried out using Research Genetics Human GeneFilters, which contain DNA derived 
from clones of the IMAGE cDNA collection, representing genes of varying degrees of characterisation. A 
series of 6 arrays of human genes were used (GeneFilters GF200-205), potentially covering a total of 
20 31,104 genes. Generally, single genes are represented only once in these arrays. However, sometimes 
IMAGE clones initially thought to represent separate genes, upon re-analysis were found to be different 
regions of the same gene. Here we have presented data for all clones individually, though they possess the 
same UniGene ID and gene name. An example is Hypothetical protein FLJ20037. 

The methodology for Research Genetics arrays is similar in principle to that described for the array 
25 screening of clones from subtracted libraries. There are several attributes to this method: Relatively small 
amounts of RNA can be labelled to make cDNA probes, in a single step reaction, and probes are labelled 
with the same chemical group (33P), so there are no errors introduced as a result of using different dyes, 
which may differ in stability etc. Using a Phosphorimager allows detection over a wide range of 
intensities (over 4 logs). Overall it is interesting to note a recent study, which has favourably re-evaluated 
30 the performance of the nylon based array, as compared with the glass-based microarray method (Bertucci 
F et al, Hum Mol Genet 8:1715-1722 (1999)). 
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Experiments were done essentially as described in the Research Genetics GeneFilters protocol. Duplicate 
copies of each array from the same production batch, were used and hybridised in parallel with labelled 
RNA isolated from normoxic and hypoxic primary human macrophages, Hybridised arrays were scanned 
twice using a Molecular Dynamics Storm phosphorimager, and both images were analysed to ensure 
5 reproducibility. Furthermore, the experiments were repeated using the same RNA samples, but with 
different array lot numbers, again to ensure reproducibility. 

Analysis was performed using Research Genetics Pathways software, with normalisation using the 'all 
data points' option. Analyses were output as spreadsheets and filtered to remove data points where the 
signal intensity was less than 4-fold above the general background for the experimental condition with the 
10 higher signal (hypoxia or normoxia depending on whether hypoxia causes induction or repression). 
Sometimes expression in the lower state was not significantly above background, and the ratio will 
therefore be underestimated. Ratios were calculated by normalised signal intensity in hypoxia divided by 
normoxia. Changes were verified visually from the original array images. 

In this manner, comparisons were made between normoxia and hypoxia in resting macrophages. The 
15 whole procedure was then repeated for activated macrophages, to investigate possible differences in the 
response to hypoxia. It is possible that potential differences for certain genes could be correlated with 
changes in expression resulting from activation, prior to challenge with hypoxia. To explore this 
possibility, comparisons were made between resting and activated macrophages, both in normoxia. Since 
some of the genes we have identified as being activated by hypoxia have very low hybridisation signals in 
20 normoxia (for both resting and activated macrophages), this comparison was not possible. 

We have found various patterns of gene expression changes occurring in response to hypoxia, related to 
the activation state of macrophages, which are presented below. Such a range of responses, specific to 
various subsets of genes, was not expected, and contradicts a view that the hypoxia response is a largely a 
generic mechanism. 

25 Table 7 shows genes that are induced by hypoxia to a similar degree in resting and activated 
macrophages. 

Table 8 shows genes that are induced by hypoxia to a greater degree in resting macrophages, as compared 
to activated macrophages. These data are presented illustratively in Figure 5. 

Data from Table 8/Figure 5 reveals several unexpected observations. 

30 A) From the final column it can be seen that macrophage activation in the absence of hypoxia, causes 
induction of many of these genes. This suggests that the signalling pathways resulting from 
activation and hypoxia might converge to a single transcriptional regulator, causing macrophage 
activation to pre-empt the response to subsequent hypoxia. This is exemplified most strikingly for 
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Interleukin 8, which is dramatically induced in response to macrophage activation, but shows no 
additional response to hypoxia. 

B) Genes in rows 11, 13 and 14 have no response to hypoxia following macrophage activation, 
though there is not a preceding large increase in expression in response to macrophage activation 

5 alone. This suggests that in the activated macrophage, the necessary signalling pathway or 

transcriptional regulator is not functional. 

C) Although Table 8 was produced electronically, without selecting genes based on their names, it can 
be seen that genes encoding proteins of the metallothionein family feature strongly. 

Table 9 shows genes which are induced by hypoxia to a greater degree in activated macrophages, 
10 compared to resting macrophages. These data are presented illustratively in Figure 6. 

In Table 7, there are several genes for which hypoxia/ normoxia ratios were only obtained for activated 
macrophages, such as Cox-2 (see row 47). For these genes, macrophage activation usually increases 
expression of the gene to detectable levels, thus allowing the study of subsequent changes in response to 
hypoxia, It is likely that these genes are not significantly expressed in resting macrophages irrespective of 
15 hypoxia, and therefore the hypoxia response is probably specific to activated macrophages. 

Certain genes respond to hypoxia by decreasing mRNA expression (repression), and these genes therefore 
have hypoxia/normoxia ratios of < 1.0. This phenomenon is known in the field of hypoxia, although the 
mechanism is obscure. Data is presented in tables 7-9, which unexpectedly shows that this hypoxia- 
induced repression for specific genes is not a generic process, but is dependent on the cellular context. In 

20 Table 10/ Figure 7, genes are presented that are hypoxia-repressed to a greater degree in activated 
(column 7) compared with resting (column 8) macrophages. Prior to any hypoxic challenge, these gene 
are induced to varying degrees, in response to macrophage activation (column 9), suggesting a shared 
mechanism for these separate responses. From Table 10, genes in rows 1-6 show that macrophage 
activation is necessary to obtain any response to hypoxia. In resting macrophages, these genes are not 

25 responsive to hypoxia at all. 

Strikingly, Table 10/ Figure 7 shows that seven separate genes encoding chemokine proteins (Monocyte 
chemotactic protein 1, Macrophage inflammatory protein lb, Monocyte chemotactic protein 3 and Small 
inducible cytokine A3, Monocyte chemotactic protein 2, Macrophage inflammatory protein 2a and 
Macrophage inflammatory protein 2 precursor) are more strongly repressed in activated macrophages as 
30 compared to resting macrophages. These genes are also among the most inducible in response to 
activation alone, in normoxia (column 9). These findings are of potential utility in view of the great 
significance of chemokines to inflammatory disease. For example, macrophage chemotactic factor 1 
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(Table 10, row- 19) is key to the pathological role of the macrophage in atherosclerosis ("Chemokines and 
atherosclerosis" Reape TJ and GrootPHE, Atherosclerosis 147: 213-225, 1999). 

Genes in rows 20-30 of Table 10, were not detectably expressed in resting macrophages, irrespective of 
hypoxia. Table 1 1 shows other genes that were down-regulated in response to hypoxia in macrophages. 

5 Example 3: Tissue-specific hypoxia regulation of gene expression by an analysis of a series of 
primary human cell cultures. 

Equivalent cultures of non-immortalised, non-transformed primary human cells of 10 distinct types, were 
cultured in either normoxia or were exposed to hypoxia for 6 hr and 18 hr, and gene expression changes 
were determined. To the inventors' knowledge, this is the first time that such a study has been reported. 
10 Moreover, unlike the vast majority of information in the public domain relating to genes responsive to 
hypoxia, all of these cells were human and were cultured without any modifications following isolation 
from the human donors. By using primary cells rather than cell lines or immortalised cultures, the 
findings of this work more accurately represents the situation in the human body. 

Most cell types were obtained from Clonetics (distributed by BioWhittaker, Walkersville, MD) and 
15 cultured according to the manufacturer's recommendations, unless where otherwise shown. #1 adipocyte 
(Clonetics CC-2568; derived from subcutaneous adult adipose tissue), #2:cardiomyocyte (Clonetics CC- 
2582; derived from fetal tissue; prior to experimentation cultured in minimal medium: DMEM, 4% Horse 
serum), #3:endothelial (TCS CellWorks ZHC-2101 human umbilical vein endothelial cells), #4:fibroblast 
(Clonetics CC-2511 dermal fibroblasts derived from adult tissue), #5:hepatocyte (Clonetics CC-2591, 
20 derived from adult tissue), #6:macrophage (derived from human blood as described elsewhere in the 
specification), #7:mammary epithelial (Clonetics CC-2551; derived from adult tissue), #8:monocyte 
(derived from human blood as described elsewhere in the specification but without the 7 day 
differentiation culture period), #9:neuroblastoma (neuroblastoma-derived cell line SH-SY5Y), #10:renal 
epithelial (Clonetics CC-2556; derived from fetal tissue), #11 skeletal muscle myocyte (Clonetics CC- 
25 2561; derived from adult. tissue). A non-primary cell type (#9) was used to represent neurons, since 
primary human neurons are difficult to source. Therefore a total of 11 cell types are compared. It should 
be noted that RNA from hepatocytes at the 16hr timepoint of hypoxia was not available for this work. 

Genes which were induced or repressed preferentially in particular cell type(s) were identified by 
hybridisation of the RNA samples to the custom gene array, as described in Examples lb and lc. Each 
30 RNA sample was hybridised to duplicate or triplicate arrays, to ensure reproducible data, and was 
analysed using GeneSpring software. Data from replicate arrays were merged during analysis to generate 
mean values. Data normalisation was achieved per-array using the aforementioned list of control genes, 
such that differences in RNA labelling or hybridisation due to experimental variation were corrected by 
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referencing each gene to the mean value of the reference genes on the same array. Also, for each gene, 
expression values were obtained which represent the value in each experimental condition (e.g. 
macrophages 6hr hypoxia) as compared to the median of value of that gene throughout the full range of 
experimental conditions (i.e. from all cell types). This transformation does not alter the relative values of 
5 any gene between the different experimental conditions, and is done since these is no obvious single 
reference experimental condition to create ratio values. This is common in microarray data analysis. 

Table 12 shows the full dataset of this analysis. From this it can be seen that certain genes respond to 
hypoxia differently, depending on the particular cell type. This information is valuable in identifying 
biological targets for the development of therapeutic and diagnostic products. Not only does it indicate a 
10 particularly significant role for these genes in the specific cell type implicated in a disease, but it also 
identifies that any therapeutic product is less likely to produce problematic toxicological effects. Data 
shown in Table 12 and the derived figures, are reproducible, and are an accurate determination of mRNA 
expression levels. This may be confirmed by independent means, such as quantitative real time RT-PCR. 

Certain genes from Table 12 will be presented for illustration. 

15 Genes with a greater response in monocytes or macrophages 

Since monocytes and macrophages are similar cell types, the latter derived from the former, they will be 
analysed together. 

Expression profiles of 11 genes showing hypoxia-induced changes in gene expression which are most 

pronounced in monocytes or macrophages are shown in Figures 8-18. These genes correspond to: 
20 Seq ID:339/340 CYP1 (cytochrome P450, subfamily XXVIIB) 

Seq ID:357/358 interleukin 1 receptor antagonist 

Seq ID:375/376 Regulator of G-protein signalling 1 

Seq ID;389/390 GM2 ganglioside activator protein 

Seq ID:405/406 Alpha-2-macroglobulin 
25 Seq ID :475/476 Ecotropic viral integration site 2A = 

Seq ID :433/434 high affinity immunoglobulin epsilon receptor beta (CFFM4) 

Seq ID ;43 1/432 Pleckstrin 

Seq ID :469/470 cytokine effector of inflammatory response SCYA3L 
Seq ID:79/80 Novel PI-3-kinase adapter 
30 Seq ID:21/22 Hypothetical protein PRO0823 

It will be appreciated that the majority of these genes have a known biological function in immunity/ 
inflammation, consistent with the known function of the monocyte/ macrophage. Further to this 
knowledge, this data identifies that in hypoxic disease sites where monocyte/ macrophages make up a 



WO 02/46465 



PCT/GB01/05458 



83 



significant proportion of the eel] types, such as in rheumatoid arthritis synovial membranes, that these 
genes are possible therapeutic targets. 

Ecotropic viral integration site 2A (Seq ID:475/476) 

For example, the gene illustrated in Figure 8, Ecotropic viral integration site 2A (Seq ID:475/476) is 
5 induced in hypoxic monocytes to a level over 25 times higher than the median expression level of this 
gene throughout the other cell types. This gene, of unknown function, is located on Chromosome 1 7q 1 1 .2 
close to genes with immune functions. Presented elsewhere in this specification is data showing that 
expression of Ecotropic viral integration site 2A is downregulated in response to the inflammatory 
cytokine interferon gamma. These novel data provide evidence that Ecotropic viral integration site 2A is a 
10 novel target for inflammatory conditions involving hypoxia and monocytes. 

Novel PI-3-kinase adapter Seq ID:79/80 Clone plE9 (EST accession R62339). 

Another example, in Figure 9a, is Seq ID :79/80 (EST accession R62339). It is seen that in hypoxic 
macrophages, this gene is expressed at 6-fold higher levels than the median expression level of this gene 
throughout the other cell types. Therefore, the levels of the encoded protein in hypoxic monocytes/ 
15 macrophages, as found at various disease sites, are likely to be higher than in other cell types not involved 
in the disease process or present at the site of disease. This illuminates a novel utility of this gene as a 
target for the development of therapeutic products for diseases involving monocytes/ macrophages and 
hypoxia* 

The data that led to the generation of this Figure are as follows: 
20 Cell type Oxvsen Normalised expression 



(clone plE9/SeqlD;79/80) 



adipocyte 
adipocyte 
adipocyte 



normoxia 



hypoxia 6hr 
hypoxia 18hr 
normoxia 



1.54 
0.89 
1.48 
1.18 
1.80 
1.53 
0.68 
0.82 
0.60 
0.60 
0.64 
0.73 
0.92 
1.62 
4.20 
3,97 
6.19 
0.25 
0.42 



25 cardiomyocyte 



cardiomyocyte 
cardiomyocyte 
endothelial 
endothelial 



hypoxia 6hr 
hypoxia 18hr 
normoxia 



hypoxia 6hr 
hypoxia 18hr 
normoxia 



30 endothelial 



fibroblast 
fibroblast 
fibroblast 
hepatocyte 



hypoxia 6hr 
hypoxia 18hr 
normoxia 



35 hepatocyte 

macrophage 

macrophage 

macrophage 

mammary epithelial 
40 mammary epithelial 



hypoxia 6hr 
normoxia 



hypoxia 6hr 
hypoxia 18hr 
normoxia 



hypoxia 6hr 
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mammary epithelial 


hypoxia 18hr 


0.18 




monocyte 


normoxia 


2.33 




monocyte 


hypoxia 6hr 


3,63 




monocyte 


hypoxia 1 8hr 


5.01 


5 


neuroblastoma 


normoxia 


0.93 




neuroblastoma 


hypoxia 6hr 


0.80 




neuroblastoma 


hypoxia 18hr 


0.85 




renal epithelial 


normoxia 


0.57 




renal epithelial 


hypoxia 6hr 


0.61 


10 


renal epithelial 


hypoxia 1 8hr 


0.61 




skeletal myocyte 


normoxia 


1.58 




skeletal myocyte 


hypoxia 6hr 


1.37 




skeletal myocyte 


hypoxia 18hr 


1.17 



15 To substantiate the array-based data, the same RNA samples were examined by real time quantitative RT- 
PCR. The advantages of this method are that it is more sensitive and because two gene-specific primers 
are used, the data will be more specific to the gene in question. 

RNA from the above samples (except for the hepatocyte RNA which was unavailable) was Dnase I- 
treated prior to reverse transcription to remove possible contaminating genomic DNA and was reverse 
20 transcribed using an oligo dT (i 5 ) primer and Superscript II reverse transcriptase. These samples were 
used as template for PCR reactions using primers specific to EST accession R62339 or to beta-actin. 
Primer sequences were as follows: 

Novel Pl-3-kinase adapter Seq ID:79/80 Clone plE9 (EST accession R62339). 
Forward Primer 5' GCC CTT ACT TTT TCA CTT CTT CGT 3' 
25 Reverse Primer 5' CCT TAA GAT CCA TTC TCA TTG CTG AT 3' 
BetaActin 

Forward Primer 5' GCC CTG AGG CAC TCT TCC A 3 
Reverse Primer 5' GCG GAT GTC CAC GTC ACA V 

All RT-PCR reactions were performed using an ABI Prism 7700 Sequence Detector system. For each Q- 
30 PCR run, a master mix was prepared with 2x SYBR Green I master mix (Applied Biosystems) and 
primers at 5|LiM. Two microlitres of respective diluted cDNA were added to PCR master mixture, 
amounting to 25)oL. The thermal cycling conditions comprised 50°C for 2 minutes, 95°C for 10 minutes, 
40 cycles at 95°C for 15 seconds, and 60°C for 1 minute. PCR reactions were set up in 96 well format 
with duplicate amplifications for each data point including 8 serial cDNA dilutions (0.2, 0.1, 0.05, 0.025, 
35 0.01, 0.005, 0.001 and 0.0001) of macrophage treated with 18 hours hypoxia to compose a standard 
curve, a no template control, no amplification control lacking reverse transcriptase, and each cDNA 
sample at a dilution value of 0.1. The experiment for the novel PI3K adapter was carried out in triplicate 
for reproducibility which were later determined by linear regression analysis. Data was analysed with 
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necessary adjustment of the default baseline and threshold line using ABI Prism 7700 software. The C t 
value, an important raw data for each sample, was calculated as the cycle number at which the ARn 
crosses the baseline. For each run, a standard curve was constructed by plotting a graph with mean C t 
values from 8 data points from standard sample against log input of the corresponding dilution values 
5 with a best fit trend line. From the trend line, the formula 'y=mx+c' was created according to the y- 
intercept and slope of standard curve which then were used for calculating the log input amount of the 
experimental cDNA samples, as related to the calibration sample. Data for the Novel PI-3-kinase adapter 
was normalized to that of beta-actin to correct for potential differences in efficiency of cDNA synthesis 
between the RNA samples. 

10 From the TaqMan data the specificity to monocytes and macrophage found from the array data is 
confirmed and found to be even more pronounced (see Figure 9b). The data presented in the Figure are 
listed below. In the data listed below, the normalized expression values are multiplied by 1000 for clarity. 





cen type 


uxygen 


iNormansea expression 
(clone pi E9 / SeqID:79/80) 


15 


adipocyte 


normoxia 


0.050 




adipocyte 


hypoxia 6hr 


0,007 




adipocyte 


hypoxia 18hr 


0.015 




cardiomyocyte 


normoxia 


0.163 




cardiomyocyte 


hypoxia 6hr 


0.037 


20 


cardiomyocyte 


hypoxia 18hr 


0.222 




endothelial 


normoxia 


3.093 




endothelial 


hypoxia 6hr 


0.059 




fibroblast 


normoxia 


0.527 




fibroblast 


hypoxia 6hr 


0.043 


25 


fibroblast 


hypoxia 18hr 


0.037 




macrophage 


normoxia 


404.593 




macrophage 


hypoxia 6hr 


503.026 




macrophage 


hypoxia 18hr 


1162.056 




mammary epithelial 


normoxia 


0.026 


30 


mammary epithelial 


hypoxia 6hr 


0.068 




mammary epithelial 


hypoxia 18hr 


0.112 




monocyte 


normoxia 


565.471 




monocyte 


hypoxia 6hr 


657.465 




monocyte 


hypoxia 18hr 


979.048 


35 


neuroblastoma 


normoxia 


8.482 




neuroblastoma 


hypoxia 6hr 


7.104 




neuroblastoma , 


hypoxia 18hr 


4.707 




renal epithelial 


normoxia 


17.898 




renal epithelial 


hypoxia 6hr 


9.831 


40 


renal epithelial 


hypoxia 18hr 


10.929 




skeletal myocyte 


normoxia 


0.930 




skeletal myocyte 


hypoxia 6hr 


0.638 




skeletal myocyte 


hypoxia 18hr 


1.627 
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There are several technical reasons why the results from the array-based data might be more pronounced 
in the Taqman results - the lower sensitivity of the array-based method means that genes which are not 
expressed will be detected as a background signal. Also the array method is more likely to suffer from 
cross-hybridisation between similar genes. 

5 The TaqMan data illustrates dramatically the concept that the hypoxia response is not just a generic 
response found in all cell types, relating to generic cell processes such as metabolism. 

Database searches for gene sequences showing identity with IMAGE clone acc:R62339 reveal that there 
are no matching human sequences of any type other than ESTs. This includes full length cDNAs, 
truncated cDNAs, gene sequences from chromosomal data or hypothetical protein gene sequences. 
10 Therefore the human gene represented by IMAGE clone acc:R62339 is a novel human gene. 

Although this human EST is unannotated, by comparison with mouse sequence data (acc AF293806), it 
appears likely to encode a novel human Phosphoinositol 3-kinase (PI3-kinase) adapter molecule, 
homologous to the recently described mouse gene, BCAP. This class of molecule, involved in 
intracellular signalling, have been shown to have utility as a drug target (see Stein RC et al "PI3-kinase 
15 inhibition: a target for drug development" Mol Med Today, 2000 Sep;6(9):347-57). PI3-kinases are key to 
many cellular processes relevant to human disease, including proliferation, apoptosis and inflammation. 
The data presented for the gene encoded by Seq ID :79/80 provides evidence that the encoded protein is a 
novel drug target in humans, specifically targeting monocyte/ macrophages at hypoxic disease sites. 

In the publication relating to murine BCAP, the protein is identified as an adapter molecule connecting 
20 the non-receptor protein tyrosine kinase Syk to the p85 subunit of PI3-kinase, and therefore to the pivotal 
signalling pathways centred around PI3-kinase (Okada T et al "BCAP: the tyrosine kinase substrate that 
connects B cell receptor to phosphoinositide 3-kinase activation." Immunity, 2000 13:817-27). Although, 
in this report, Syk is acting as the intracellular signalling component of the B cell antigen receptor, which 
is present exclusively on B-cells, Syk has been shown to initiate intracellular signalling from other cell 
25 surface receptors which are expressed on macrophages, including the Fc gamma receptor, the chemokine 
receptor CCR5 and macrophage-expressed CD8 (Darby C et al "Stimulation of macrophage Fc gamma 
RIIIA activates the receptor-associated protein tyrosine kinase Syk and induces phosphorylation of 
multiple proteins including p95Vav and p62/GAP-associated protein". J Immunol 1994 152:5429-37) 
(Kedzierska K et al "FcgammaR-mediated phagocytosis by human macrophages involves Hck, Syk, and 
30 Pyk2 and is augmented by GM-CSF." J Leukoc Biol 2001 Aug;70(2):322-8.), (Ganju RK et al "Beta- 
chemokine receptor CCR5 signals through SHP1, SHP2, and Syk." J Biol Chem. 2000 275:17263-8.), 
(Lin TJ et al "Activation of macrophage CD8: pharmacological studies of TNF and IL-1 beta 
production." / Immunol 2000 164:1783-92.). 
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Indeed, syk has been validated as target in macrophages to inhibit inflammatory activities of this cell type 
(Stenton GR et al "Aerosolized Syk antisense suppresses Syk expression, mediator release from 
macrophages, and pulmonary inflammation." J Immunol 2000 Apr l;164(7):3790-7.). 

Additional to the finding that the probable human orthologue of the adapter molecule BCAP is 
5 preferentially hypoxia-induced in human monocytes/ macrophages, we also find from data generated by 
the custom array, that the protein acting immediately upstream of BCAP (i.e. Syk) is also regulated by 
hypoxia in this novel cell type specific manner, greatly increasing the biological significance of the 
original finding (see Figure 9c). The data used to generate this Figure are presented below for clarity. 
Cell type Oxygen Normalised expression 



10 






(of svk) 




adipocyte 


normoxia 


2.6573591 




adipocyte 


hypoxia 6hr 


1.499927 




adipocyte 


hypoxia 18hr 


1.1 115488 


15 


cardiomyocyte 


normoxia 


0.8357341 




cardiomyocyte 


hypoxia 6hr 


2.161058 




cardiomyocyte 


hypoxia 18hr 


0.908801 14 




endothelial 


normoxia 


0.60265505 




endothelial 


hypoxia 6hr 


0.56874704 


20 


endothelial 


hypoxia 18hr 


0.43321633 




fibroblast 


normoxia 


0.8542026 




fibroblast 


hypoxia 6hr 


0.7657573 




fibroblast 


hypoxia 18hr 


0.784982 




hepatocyte 


normoxia 


0.5238476 


25 


hepatocyte 


hypoxia 6hr 


0.8465495 




macrophage 


normoxia 


4.272981 




macrophage 


hypoxia 6hr 


6.144931 




macrophage 


hypoxia 18hr 


10.278416 




mammary epithelial 


normoxia 


1.1023632 


30 


mammary epithelial 


hypoxia 6hr 


2.7382789 




mammary epithelial 


hypoxia 18hr 


0.7985004 




monocyte 


normoxia 


6.052118 




monocyte 


hypoxia 6hr 


8.6809225 




monocyte 


hypoxia 18hr 


1 1.58468 


35 


neuroblastoma 


normoxia 


1.0230793 




neuroblastoma 


hypoxia 6hr 


1.089154 




neuroblastoma 


hypoxia 18hr 


0.7689335 




renal epithelial 


normoxia 


0.88565326 




renal epithelial 


hypoxia 6hr 


1.2609364 


40 


renal epithelial 


hypoxia 18hr 


0.6242461 




skeletal myocyte 


normoxia 


1.3959162 




skeletal myocyte 


hypoxia 6hr 


0.91255134 




skeletal myocyte 


hypoxia 18hr 


0.64795935 



45 In summary, we have shown here that a novel human gene encoding a predicted signalling protein 
relevant to human disease is activated by hypoxia, specifically in monocytes and macrophages. This data 
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is validated by non-array based means. Furthermore, we identify the protein immediately upstream of this 
signalling system as being co-regulated in this manner too. Therefore the human PI3-kinase adapter 
encoded by IMAGE clone acc; R62339 and the non-receptor tyrosine kinase Syk are both identified here 
for the first time as therapeutic targets for diseases involving hypoxic macrophages, including 
5 Rheumatoid arthritis, chronic occlusive pulmonary disease, atherosclerosis and cancer. Because both 
genes are preferentially expressed in hypoxic macrophages, toxicity effects of therapeutic products 
directed at the encoded proteins are likely to be limited. 

As discussed in detail above, fragments and functional equivalents of the PI-3-kinase adapter protein 
represented in Seq ID :79/80 and other equivalent proteins are included within the present invention, in 
10 addition to ligands that bind specifically to these proteins. Furthermore, the invention also embraces 
purified and isolated nucleic acid molecules encoding these proteins, fragments and functional 
equivalents, vectors containing such nucleic acid molecules and host cells transformed with these vectors. 

Regulator of G-protein signalling 1 (Seq ID;375/376) 

Another intracellular signalling protein, Regulator of G-protein signalling 1 (RGS1; Seq ID:375/376), in 
15 shown in Figure 10. Here the expression levels in the hypoxic monocyte is 30-fold higher than the median 
expression level of this gene throughout the other cell types. The function of this protein is to negatively 
regulate G protein signalling pathways, and inhibit chemokine-induced cell migration of immune cells 
(Moratz C et al J Immunol, 2000 164:1829-38 and Denecke B et al / Biol Chenu 1999 274:26860-8.). 

Our data suggests that this gene is preferentially expressed in macrophages, consistent with the findings 
20 of Denecke B et al (/ Biol Chem. 1999 274:26860-8.). Our novel finding that expression is even further 
enhanced by hypoxia illuminates a mechanism by which cell migration is inhibited in hypoxia, leading to 
an accumulation of these cells at pathological sites of hypoxia. This mechanism is novel and distinct to 
other mechanisms proposed in the art to explain this key aspect of hypoxia and inflammation (for 
example: Grimshaw MJ et al "Inhibition of monocyte and macrophage chemotaxis by hypoxia and 
25 inflammation-a potential mechanism." Eur J Immunol. 2001 31:480-9). 

Furthermore, Figure 10 shows that Regulator of G-protein signalling 1 is upregulated during 
differentiation of monocytes to macrophages, with significance to changes in cell motility. This discovery 
therefore provides that inhibitors of RGS1 have utility in increasing the motility of macrophages that are 
used for cell-based therapies. Accordingly, one embodiment of this aspect of the invention provides for 
30 the use of an inhibitor of RGS 1 in therapy, by increasing the motility of macrophage cells. 

GM2 ganglioside activator protein 

The gene shown in Figure 11, GM2 ganglioside activator protein, was originally characterised as a 
lysosomal co-factor required for degradation of gangliosides. It has been proposed to have alternative 
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roles as a secreted protein, and can bind and inhibit the actions of the inflammatory mediator, platelet 
activating factor (Rigat B et al Biochem Biophys Res Commun. 1999 258:256-9,). 

Our novel finding, presented in Figure 11, shows that GM2 ganglioside activator protein is induced by 
hypoxia, preferentially in macrophages, suggesting an influence on the inflammatory functions of the 
5 macrophage in hypoxia. 

In Figures 15-18, genes are shown which are expressed preferentially in the monocyte/ macrophage, but 
which are decreased in expression in response to hypoxia. Being expressed at highest levels in the 
monocyte/ macrophage, these genes are more likely to be significant to the biological functions of this 
cell type, 

10 Interleukin 1 receptor antagonist (Seq ID:357/358) 

In Figure 15, the gene interleukin 1 receptor antagonist (Seq ID;357/358) is seen to be down-regulated by 
hypoxia in the macrophage. Since the function of the encoded protein is anti-inflammatory, then down- 
regulation of this gene would be expected to have a pro-inflammatory effect. Therefore, corrective 
expression of the gene, would be expected to produce therapeutic effects in inflammatory disorders 
15 involving macrophages and hypoxia, such as Rheumatoid Arthritis (Hollander AP et al. Arthritis Rheum. 
2001 44:1540-4). This correlates with effects seen from the application the drug Anakrina / Kineret™ 
developed by Amgen, This supports the applicability of the genes disclosed herein as novel targets for 
therapeutic products. 

The example of gene interleukin 1 receptor antagonist also provides good exemplification of the concept 
20 that different cell types respond to hypoxia differently. Here, not only are there quantitative differences, 
but also qualitative differences in that this gene is down-reguhted by hypoxia in macrophages, but up- 
regulated by hypoxia in several other cell types, such as renal epithelial cells (see Figure 15), Such 
findings are not documented in the art. 

The dataset of Table 12 also contains genes which are induced preferentially in monocyte/ macrophages 
25 and also in some but not all other cell types tested, Several of these genes are present as multiple clones 
on the gene array, giving separate data, therefore adding extra confidence to the conclusions. These genes, 
presented in Figures 19-28 correspond to: 

SeqID:313/314 adipophilin 

SeqID:163/164 Hypothetical protein FLJ13511 

30 SeqID:267/268 Osteopontin 

SeqID:17/18 Hematopoietic Zinc finger protein 

SeqID:137/138 CYP1B1 

SeqID:325/326 CYP1B1 
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It will also be seen that in the case of CYP1B1 (clones pi Fl 6 and plE3) the hypoxia response in 
monocytes / macrophages is qualitatively different to the other cell types tested, in that the gene is up- 
regulated rather than down-regulated in response to hypoxia. 

Genes with a greater response in endothelial cells 

5 The dataset of Table 12 also contains genes which are induced preferentially in endothelial cells, a cell 
type key to the process of angiogenesis, in response to hypoxia, These genes are as follows, and are 
presented in Figures 29-31: 

SeqID:205/206 Hypothetical protein FLJ22690 
SeqID:65/66 cDNA DKFZp586E1624 
10 SeqID:197/198 EST 

Genes with a greater response in hepatocytes 

The dataset of Table 12 also contains genes which are induced preferentially in hepatocytes, in response 
to hypoxia. These genes are presented in Figures 32a and 33-38. It is noted that most of these genes, 
including hqp0376, encode proteins of the metallotheionein family, Furthermore, close inspection of these 
15 data reveals that the fold induction in hypoxia compared to normoxia for monocyte/ macrophages are 
very high, though the absolute levels of expression are below that of hepatocytes. 

SeqID:85/86 EGL nine (C.elegans) homolog 3 

SeqID:83/84 Novel Metallothionein 

SeqID:337/338 Hypothetical protein, hqp0376 (a metallotheionein) 

20 SeqID:265/266 Metallothionein 2 A 

SeqID:243/244 Metallothionein 1G 

SeqID:141/142 Hepcidin antimicrobial peptide 

SeqID:239/240 Metallothionein 1H 

EGL nine (C.elegans) homolog 3 

25 As described above, it has been discovered that a polypeptide encoded by a gene identified from the EST 
recited in SEQ ID No 86, having the Protein accession number BAB 15101 (encoded by Homo sapiens 
cDNA: FLJ21620 fis, clone COL07838 Nucleotide accession AK025273) is regulated by hypoxia. Other 
public domain sequences corresponding to this gene include Homo sapiens cDNA: FLJ23265 fis, clone 
COL06456 Nucleotide accession AK026918. Accordingly, when referring in the present specification to 

30 the EST recited in SEQ ID No 86, it is intended that these gene and protein sequences are also embraced. 
This gene was identified using Research Genetics Human GeneFilters arrays, which contain an EST 
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corresponding to the gene (accession number R00332). The gene is now termed EGL nine (C.elegans) 
homolog3. 

There are no reports that describe the function of this human gene. However, a high degree of amino acid 
homology is observed between the protein encoded by this gene, and a rat protein called "Growth factor 
responsive smooth muscle protein" or "SM20" (Nucleotide accession U06713; Protein accession 
A53770). An alignment of single letter amino acid sequences is shown below. Over the highlighted 
region there is 97% amino acid similarity and 96% amino acid identity. 



A53770 (1) MTLRSRRGFLSFLPGLRPPRRWLRISKRGPPTSHWASPALGGRTLHYSCR 

10 BAB15101 (1) — 

51 100 
A5377 0 ( 51 ) SQSGTPFSSEFQATFPAFAAKVARGPWLPQWEPPARLSASPLCVRSGQA 

BAB15101 (1) 

101 150 
15 A53770 (101) LGACTLGVPRLGS VSEMPLGHimLDLEkl ALEYI VPCLHEVGFC YLDNFi 

BAB1 5101 (1) MPLGH1MRLDLEKIALEYIVPCLHEWFC YLDNF 

151 * ~ " "J"" ' ' 200 

A53770 (151) ^GEWGDCVLERWQL 
BAB 15 101 (35) LGEWGDC^ERWQLHCTGALRDGQLAGPFAGVSKRHLRGDQITWIGGN 
20 201 "* " " *~ ^ "~ -—250 

A53770 (201) feEGCEAINPLLSLIIDRL 
BAB15101 (85 ) lEEGCEA J SFLL SLI DRLVLYCGSRLGKYl^\n<ERSKAM\ ? AC Y PGNGTGYVR 
251 I ~ ' W ~ I 300 

A53770 (251) HVDNPNGDGRCXTCIY 
25 BAB1 5 1 0 1 (13 5) HVDN PNGDGRC I TC IYY^Nl^ 

301 ' 350 
A53770 (301) jLLFSWSDRRNPHEVQPSYAT 
BAB 15101 (185) JL.LFFWSDRRNPHE VQP S YATRYAMT W YFDAE ERAE AKKKFRNLTRKTE S 
351 " 

30 A53770 (351) EAI© 

BAB15101 (235) ^TEti 



The high degree of amino acid similarity. suggests that the human protein BAB 15101 has an equivalent 
biochemical function to the rat protein A53770 ("Growth factor responsive smooth muscle protein" or 
35 "SM20"), Recent publications have shown that SM20 functions to promote apoptosis in neurons 
(Lipscomb et al, J Neurochem 1999; 73(l):429-32; Lipscomb et a/., J Biol Chem 2000 Nov 1; [epub 
ahead of print]). Significantly, SM20 has been shown to be expressed at high levels in the heart (Wax et 
al } J Biol Chem 1994; 269(17): 13041-7). 

It has also been discovered that a polypeptide encoded by a gene identified from the EST recited in SEQ 
40 ID No 90, having the Protein accession number CAB81622, is regulated by hypoxia. The encoding 
human gene has been annotated in the UniGene database as "Similar to rat smooth muscle protein SM- 
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20"; the nucleotide sequence is contained within the nucleotide accession AL1 17352, More recently, a 
longer fragment of this gene has been cloned, named clorfl2, or EGLN1 (Nucleotide accession 
AAG34568; Protein accession AAG34568). Accordingly, when referring in the present specification to 
the EST recited in SEQ ID No 90, it is intended that these gene and protein sequences are also embraced. 

5 This distinct human gene, encoding a protein related to SM20 and EGLN3 (BAB 15101), is also induced 
in response to hypoxia, This gene was identified using Research Genetics Human GeneFilters arrays, 
which contain an EST corresponding to the gene (accession number H56028). 

Independently to this, a fragment of this gene has been cloned from a cDNA library derived from hypoxic 
human cardiomyoblasts, and it has been shown that the gene is increased in expression in response to 
10 hypoxia in this cell type (see Table 1 herein; penultimate row). The nucleotide sequence of this cDNA 
fragment is referred to herein as SEQ ID No 90a. 

In the light of this novel discovery reported herein that these human equivalents of SM20 are induced by 
hypoxia, it is herein proposed that in cardiac ischaemia, the resulting apoptosis is due at least in part, to 
increased expression of these genes. The therapeutic modulation of the activity of EGLN3 (BAB 15101), 

15 cl orf 1 2 (AAG34568), CAB81622, SM20 and other equivalent proteins and encoding genes therefore 
provides a novel means for the treatment of myocardial ischaemia, through the alteration of the 
propensity of myocardial .cells to undergo apoptosis. For example, a suitable treatment may involve 
altering the susceptibility of ischaemic myocardial tissue to subsequent reperfusion and re-oxygenation, 
or may involve modulating the susceptibility of chronic ischaemic myocardial tissue (including forms of 

20 angina) to later more severe ischaemia, which would result in myocardial infarction. It is submitted that, 
by way of analogy, cerebral ischaemia may be treated using the same principle. 

Although the Applicant does not wish to be bound by this theory, the downstream effects of SM20 and 
related genes such as EGLN3 (BAB 15101), clorfl2 (AAG34568), and CAB81622, namely, apoptosis 
and angiogenesis might be explained as follows. The apoptotic effect of NGF withdrawal may be 

25 mediated by induction of the hypoxia pathway, but may be an aspect of the supposed involvement of the 
HEF protein in the stress response. HEFla is induced by reactive oxygen species (see Richard et al. J Biol 
Chem 2000 Sep 1;275(35):26765-71). This could, in turn, be mediated by over-load of the proteosomal 
pathway for HIFla degradation and the consequent accumulation of undegraded HOFla. Accordingly, it 
is considered that modulation of SM20 and the related genes EGLN3 (BAB 15101), clorfl2 

30 (AAG34568), and CAB81622 may have applications in the treatment of diseases resulting from 
disturbances in proteosome function, such as prion diseases and other neurodegenerative diseases. 

These data provide the first connection between these related genes and the physiological response to 
hypoxia, Recently published research papers have identified that the protein products of these genes can 
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act as prolinehydroxylases (see Bruick RK et al Science. 2001 294:1337-40 and Epstein AC et al Cell. 
107:43-54), This is consistent with our observations that certain proline hydroxylases are induced in 
response to hypoxia and the genes EGLN1 and EGLN3 are part of the hypoxia response. For example, 
two genes encoding proline hydroxylases have been identified herein as being increased in expression in 
5 response to hypoxia (proline 4-hydroxylase, alpha polypeptide 1; SeqID: 231/232, proline 4-hydroxylase, 
alpha polypeptide II; SeqID: 349/ 350). This identified a functional significance of proline hydroxylation 
as a response to hypoxia. 

Proline hydroxylase leads to degradation of HIFla in normoxia (HEF regulates its own degradation - 
feedback). Hydroxylated HIFla + VHL leads to ubquitination and consequent degradation of HFla by 
10 proteosome. The activity of the prolyl hydroxylase is 0 2 -dependent, so under conditions of hypoxia, 
HIFla is not hydroxylated efficiently and is stabilised. HIFla protein thus accumulates to a high level. 
The hypoxia-induction of the prolyl hydroxylase ensures that when 0 2 concentration returns to normal, 
there is sufficient enzyme available to target this high level of HIFla efficiently for rapid degradation. 

Degradation of HIFla is dependent on HIF1 -induced transcription (i.e. is hypoxia inducible). Berra et al 
15 (FEBS Lett 2001 Feb 23;491(l-2):85-90) raises the specific hypothesis of an unknown hypoxia-inducible 
factor which targets HIFla for proteosomal degradation. It appears reasonable to propose that this factor 
will clearly be hypoxia-inducible, to ensure that a rapid and effective constraint on the hypoxic response 
would operate on return to normoxia. It now appears as if the genes EGLN1 and EGLN3 form part of this 
mechanism. 

20 It is also hypothesised that SM20 and the related genes EGLN3 (BAB15101), clorfl2 (AAG34568), and 
CAB 81622 may act as tetramers. Known prolyl hydroxylases such as prolyl 4-hydroxylase (P4H) are 
known to act as tetramers of two alpha subunits and two beta subunits. SM20 and the related genes 
exhibits high similarity to the alpha subunit of P4H and it therefore seems likely that SM20 and the 
related genes are likely to have a binding partner that is equivalent to the beta subunit of P4fL 

25 SM20 has been shown to bind to the transcription factor HIFla, and shares a low level homology with a 
p53 binding protein. P53 is a transcription factor that is known to be involved in apoptosis. Accordingly, 
it is proposed that in addition to binding to HIF1A, SM20 and the related genes EGLN3 (BAB 15101), 
clorfl2 (AAG34568), and CAB81622 may also bind and modify other transcription factors that are 
involved in the hypoxic response such as EPAS and HIF3A, or other transcription factors such as p53 and 

30 thereby influencing apoptosis. This aspect of the invention thus provides dimer and tetrameric forms of 
the EGLN3 (BAB 15101), clorfl2 (AAG34568), and CAB81622 proteins, preferably complexed with a 
protein selected from the group consisting of HIFla, p53 and a protein binding partner that is equivalent 
to the beta subunit of P4H. Preferably, such dimers and tetramers are heterodimers/heterotetramers. 



WO 02/46465 



PCT/GB01/05458 



94 

To provide further evidence that these related genes are a significant part of the hypoxia response 
additional expression data is presented here, 

Expression profiles for these two genes will be displayed with pre-chip normalisation to correct for 
differences in RNA labelling etc, but within each gene no further normalisation is done (per-gene 
5 normalisation), so the relative absolute expression levels of the two genes can be compared and Y-axis 
units between separate graphs from the same experiment are comparable. These graphs are presented as 
Figures 32b (clorfl2) and 32c (EGLN3). 

It can be seen from these Figures that both genes (clorfl2 and EGLN3) are inducible in response to 
hypoxia in macrophages whether activated by gamma interferon and lipopolysaccharide or if de-activated 
10 by treatment with interleukin-10» In macrophages the absolute expression level of C lorfl 2 appears to be 
higher than EGLN3. 

There is a prejudice in the art that the response to hypoxia is generic to all cell types. Contrary to this, we 
show herein that genes are regulated by hypoxia to a greater degree in certain cell types, substantiating 
their utility in designing specific therapeutic products for diseases involving those cell types. 

15 From Figures 32a and 32d and the data presented below, differing expression profiles of the two related 
genes clORF12 and EGLN3 are apparent throughout the 11 tested cell types, though Clorfl2 is generally 
expressed at higher levels than EGLN3. 



Cell type 



20 



adipocyte 
adipocyte 
adipocyte 
cardiomyocyte 

25 cardiomyocyte 
cardiomyocyte 
endothelial 
endothelial 
endothelial 

30 fibroblast 
fibroblast 
fibroblast 
hepatocyte 



Oxygen mRNA expression mRNA expression 

(C10RF12 SeqID;89/90) (EGLN3 SeqID:85/86) 



normoxia 


0.0075 


0.0033 


hypoxia 6hr 


0.0091 


0.0027 


hypoxia 18hr 


0.0182 


0.0025 


normoxia 


0.0067 


0.0019 


hypoxia 6hr 


0,0381 


0.0023 


hypoxia 18hr 


0.0201 


0.0026 


normoxia 


0.0198 


0.0019 


hypoxia 6hr 


0.0583 


0.0033 


hypoxia 18hr 


0.0397 


0.0026 


normoxia 


0.0119 


0.0032 


hypoxia 6hr 


0.0260 


0.0046 


hypoxia 18hr 


0.0235 


0.0040 


normoxia 


0.0075 


0.0080 
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hepatocyte 


hypoxia 6hr 


0.0074 


0.0146 




macrophage 


normoxia 


0.0033 


0.0008 




macrophage 


hypoxia 6hr 


0.0083 


0.0018 




macrophage 


hypoxia 18hr 


0.0058 


0.0021 


5 


mammary epithelial 


normoxia 


0.0065 


0.0014 




mammary epithelial 


hypoxia 6hr 


0.0137 


0.0055 




mammary epithelial 


hypoxia 1 8 hr 


0.0144 


0.0065 




monocyte 


normoxia 


0.0027 


0.0006 




monocyte 


hypoxia 6hr 


0.0084 


0.0014 


10 


monocyte 


hypoxia 1 8hr 


0.0080 


0.0016 




neuroblastoma 


normoxia 


0.0344 


0.0011 




neuroblastoma 


hypoxia 6hr 


0.1085 


0.0013 




neuroblastoma 


hypoxia 18hr 


0.0551 


0.0020 




renal epithelial 


normoxia 


0.0275 


0.0046 


15 


renal epithelial 


hypoxia 6hr 


0.0560 


0.0046 




renal epithelial 


hypoxia 18hr 


0.0395 


0.0096 




skeletal myocyte 


normoxia 


0.0088 


0.0029 




skeletal myocyte 


hypoxia 6hr 


0.0277 


0.0035 




skeletal myocyte 


hypoxia 18hr 


0.0245 


0.0038 



20 



For instance, in the hypoxic hepatocyte (6hr) the normalised expression values of EGLN and clorfl2 are 
0.015 and 0.0074 respectively, i.e. EGLN being the dominant gene. In contrast, in the neuroblastoma cell 
line SH-SY5Y, the normalised expression values of EGLN and clorfl2 after 6hr hypoxia are 0.0012 and 
0.108 respectively, i.e. clorfl 2 being the dominant gene by a large margin. This data demonstrates that 
25 clORF12 and EGLN3 are not constitutively expressed at an equal amount in different tissues indicating 
specificity of function. Therefore, it is considered that therapeutic products may be developed based on 
this data, with the goal of modulating proline hydroxylation of target proteins (such as HIF1 alpha) in 
specific tissues, based on the differing expression profile of clORF12 and EGLN3 in those tissues. 

In Example lb herein, genes were identified from a custom array, which give a greater induction in 
30 macrophages (by a factor of at least 1.5) when hypoxia is augmented by over-expression of HIF1 alpha or 
EPAS from an adenovirus. The data from the HIF/ EPAS over-expression work is presented herein in 
Example Ic, but specifically relating to clORF12 and EGLN3 is summarised in Figures 32e and 32f. 
From this data it is apparent that EGLN3/ FLJ21620 fis cl.COL07838 but not clORF12 is increased in 
expression by the transcription factor EPAS1 but not HIFlalpha. This is apparent by comparing 
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experimental condition 9 (hypoxia with EPAS overexpression; expression value=3.48) to that of 5 
(hypoxia without EPAS overexpression; expression value= 1.65). This adds valuable information about 
the mechanism of regulation of the gene encoding EGLN3. 

To confirm this data the RNA samples for experimental conditions 1,3,5,7,9 (corresponding to the high 
5 dose of adenovirus) were also measured using a different array-based methodology- the AffyMetrix 
GeneChip. The results of this experiment are presented in Figures 32g and 32h. 

Functional Characterisation of EGL nine (C.elegans) homolog 3 role in the induction of Cardiomyocyte 
apoptotic cell death 

EGLN3 has been cloned into pONY8,l and Smart2.IRES.GFP equine infectious anaemia virus (EIAV) 
10 vectors, and AdCMV. TRACK. GFP (AdenoQuest) adenoviral genome vectors (see co-owned co-pending 
International patent application PCT/GB01/00758). These vectors have been used in "gain-of-function" 
studies in which EGLN3 has been overexpressed in order to elucidate corresponding protein function. 
Human embryo kidney (HEK 293T) and dog osteosarcoma (D17) cell lines have been used in transient 
plasmid transfection experiments to confirm EGLN3 expression from viral vector genomes. Rat 
15 cardiomyocyte cell line (H9C2) and primary human neonatal cardiomyocytes (PHNC) (BioWhittaker, 
CC2582) have been used in viral transduction experiments to determine the biological activity of EGLN3. 
In all cell types, expression of EGLN3 has been followed by combinations of immunofluorescence, 
Western blotting and TaqMan quantitative PCR. Immunofluorescence and Western blotting employ an 
antibody specific for the FLAG epitope engineered into the 3' terminus of EGL nine (C.elegans) homolog 
20 3 (Sigma, F3165). TaqMan quantitative PCR utilises the SYBR Green method (Applied Biosystems). 

Western blotting has confirmed the transient expression of EGLN3 from an EIAV genome construct in 
HEK 293T (expected size approx 717 bp, 26 Kda). Immunofluorescence has localised transient 
expression of EGL nine (C.elegans) homolog 3 from EIAV expression construct in HEK293T to the 
cytoplasm. Expression of EGL nine (C.elegans) homolog 3 is elevated after 4 hours exposure to hypoxic 

25 conditions (0.1% (v/v) oxygen), when compared to expression observed under normoxia (20% (v/v) 
oxygen) (see Figure 32i). TaqMan primers have been designed and optimised for the initial measurement 
of EGL nine (C.elegans) homolog 3 expression in EIAV or Adenovirus transduced H9C2 and PHNC 
(Forward: TCATCGACAGGCTGGTCCTC; Reverse: GTTCCATTTCCCGGATAGAA). All findings at 
the RNA level are corroborated by immunofluorescence and Western blotting analyses at the protein 

30 level. 

EIAV transduction of H9C2 and PHNC has been optimised with constructs containing green fluorescence 
protein (GFP) and LacZ reporter genes, using the VSVg envelope and a range of MOI between 10 and 
100. GFP results were scored by fluorescence microscopy, while LacZ transductants were identified 



WO 02/46465 PCT/GBO 1/05458 

97 

through the assay of J3-galactosidase activity. An MOI of 50 transduced approximately 50% of the cell 
population. 

EGLN3 is predicted to have pro-apoptotic activity in cardiomyocytes. Early, Mid and late phase apoptosis 
are characterised by translocation of membrane phospholipid phosphatidylserine (PS) from the inner face 
5 of the plasma membrane to the cell surface, activation of specific proteases (caspases) and fragmentation 
of DNA, respectively (Martin, S J., et aL, J. Exp, Med. 1995, 182, 1545-1556; Alnemri, E.S., et aL, J. 
Cell. Biochem. 1997, 64, 33-42; Wylie, A.H., et aL, Int. Rev. CytoL 1980, 68, 251-306). Translocation of 
PS has been identified through use of ApoAlert kit (Clontech; K2025-1), which employs FITC-labelled 
antibodies to detect surface expression of the PS, Annexin V. Caspase activity has been followed using 

10 the homogeneous fluorimetric caspase assay (Roche; 3005372) which allows the quantification of caspase 
activity through the cleavage of a fluorescent substrate. DNA fragmentation has been estimated using the 
nuclear stain Hoescht 33345 (Sigma, B2261; and fluorescence microscopy to locate areas of chromatin 
condensation. Total viability of cell population has been quantified through measurement of the ability of 
mitochondrial reductase to metabolise the fluorescent substrate MTT (Sigma, M2128)(Levitz S.M & 

15 Diamond, R.D. J. Infect. Dis. 1985 Nov; 152(5):938-45). 

Conditions for early, mid and late stage apoptosis in H9C2 and PHNC have been defined using hypoxia 
and nutrient-depleted growth medium to mimic those ischaemic conditions found in vivo (Brar, B.K., et 
al„ J. Biol. Chem. 2000, 275, 8508-8514), Transduction of PHNC with EIAV vectors containing E.GLN3 
is sufficient to cause an increase in caspase activity in cells cultured under normoxic conditions, 
20 confirming the role of EGLN3 in the induction of cardiomyocyte apoptosis. Using an MOI of 50, a 2-fold 
increase in caspase activity was seen in EGLN3 transduced cells, when compared to controls 48 hours 
post transduction (see Figure 32j). 

Increased expression of EGL nine (C.elegans) homolog 3 in transduced cells is confirmed by TaqMan, 
immunofluorescence and Western blotting. Similar experiments are performed to determine whether EGL 
25 nine (C.elegans) homolog 3 expression further sensitises H9C2 and PHNC to previously defined 
ischaemic insults. Staurosporine (Calbiochem; 569397) and Smart2.IRES.GFP EIAV vectors containing 
the Bax gene will be applied as chemical and viral pro-apoptotic controls, respectively (Yue, T-L., et al., 
J, Mol. Cell. Cardiol. 1998, 30, 495-507; Reed, J.C. J Cell Biol. 1994, 1 24(1 -2):1 -6). 

30 Gene silencing approaches may be undertaken to down-regulate endogenous expression of .EGLN3 in 
PHNC to determine the degree of protection against apoptotic cell death provided by a reduction in 
EGLN3 activity, RNA interference (RNAi) (Elbashir, SM et aL, Nature 2001, 411, 494-498) is one 
method of sequence specific post-transcriptional gene silencing that may be employed. Short dsRNA 
oligonucleotides are synthesised in vitro and introduced into a cell. The sequence specific binding of 
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these dsRNA oligonucleotides triggers the degradation of target mRNA, reducing or ablating target 
protein expression. A Hammerhead ribozyme library, contained in EIAV expression vectors, may also be' 
applied. Efficacy of both gene silencing approaches may be assessed initially through the measurement of 
EGLN3 expression, at the RNA level by TaqMan and at the protein level by Western blotting, Protection 
5 against previously described ischaemic insults provided by these methods of EGLN3 gene silencing may 
be assayed biologically as detailed above, Caspase inhibitors (caspase 3 inhibitor V, 2129002 and caspase 
inhibitor I, 627610, both Calbiochem) and Smart2.IRES.GFP EIAV vectors containing the Bcl-2 gene 
may be applied as chemical and viral anti-apoptotic controls, respectively (Kroemer, G. Nat Med. 1997, 
3(6):614-20). 

10 Similar "gain-of-function" and gene silencing approaches will be applied to the related gene, encoded by 
SEQ ID 90, named clofl2. 

Genes with a greater response in renal epithelial cells 

The dataset of Table 12 also contains genes which are induced preferentially in renal epithelial cells, in 
response to hypoxia. These genes are presented in Figures 39-44. 

15 SeqID:l 17/1 18 EST 

SeqID;129/130 Hypothetical protein FLJ22622 

SeqID:31/32 TRIP-Br2 

SeqID:301/302 Tumor protein D52 

SeqID:91/92/92aSemaphorin 4b 
20 SeqID:371/372 Dec-1 

For Semaphorin 4b (SeqID;91/92/92a), the clone presented in Figure 43 is p 1 P 1 4 , corresponding to 
IMAGE clone acc BE910319, the sequence of which covers a large region of the gene including protein 
coding sequence, which may cross-hybridise to other members of the semaphorin family. A separate 
clone (plD17) as found in the original filing, was derived from the subtracted library and corresponds to a 
25 more unique region of this gene in the untranslated region. From Table 12 it will be appreciated that a 
significant response is also found in the macrophage. This is validated by RNase protection assay data 
(see Figure 57). Further clarification of this gene using complementary experimentation methods will 
resolve the exact cell-type specific nature of the expression of this gene, though it is clear from this data 
that it is induced in renal epithelial cells and macrophages. 

30 Genes with a greater response in mammary epithelial cells 

The dataset of Table 12 also contains genes which are induced preferentially in mammary epithelial cells, 
in response to hypoxia. These genes are presented in Figures 45-52. 
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SeqID:447/448 Calgranulin A 
SeqID:67/68 ER01 (S. cerevisiae)-like 

SeqID;25/26 -Hypothetical protein FLJ20500 

SeqID:229/230 N-myc downstream regulated 
5 SeqID:387/388 Decidual protein induced by progesterone 
SeqID:379/380 Integrin, alpha 5 
SeqID:225/226 Tissue factor 
SeqID:237/238 COX-2 

In the case of Cox-2, which encodes a key drug target, it can be seen that in many cell types, especially 
10 the mammary epithelial cells, there is a clear induction in response to hypoxia. In contrast, for endothelial 
cells there is a very significant time-dependent decrease in Cox-2 gene expression in response to 
hypoxia, Similarly, for Calgranulin A, there is strong positive induction in hypoxic mammary epithelial 
cells, but in the macrophage, the response to hypoxia is negative. These clearly exemplify the unexpected 
finding that cell types respond to hypoxia differentially, both quantitatively but also qualitatively. This is 
15 not currently known. 

Hypoxia regulation of Novel human genes 

From Table 12, it will be appreciated that several genes with no prior annotation in public domain gene 
sequence databases are now identified as being regulated by hypoxia, in at least one cell type. To make 
this clear, these genes have been copied from Table 12 and presented in Tables 13 and 14), showing the 

20 hypoxia/ normoxia induction ratio of the cell type in which the response is most pronounced. These 
figures are derived by dividing the normalised expression value, as found in Table 5, in hypoxia by that in 
normoxia for the same cell type. In some cases, where hypoxia causes inhibition of gene expression, the 
fold change is prefixed by the term "DOWN". The cell type and time point of maximal response to 
hypoxia are also noted in Tables 13 and 14. The main purpose of Tables 13 and 14 is to demonstrate that 

25 these genes have significant responses to hypoxia per se, 

In many cases, significant responses are seen in multiple cell types, though this data is not apparent here. 
In Table 13, the cDNA clones are currently un-annotated in public domain databases. In Table 14, the 
cDN A clones are currently annotated, but were not so as at the priority date. 

Example 4: Additional disclosure of the effect of macrophage activation on hypoxia regulation of 
30 gene expression 

In Example 2, it is shown that activated and resting macrophages respond to hypoxia in different ways, 
showing that the hypoxia response is not a generic phenomenon. To consolidate this data, experiments 
were performed with the custom array, using additional experimental conditions and with a more in-depth 
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analysis, Significantly, the expression values used are not simple hypoxia/ normoxia ratios, done 
separately for macrophages of differing activation status, but rather the values used allow comparison of 
the relative expression levels throughout the entire set of experimental conditions. Hence, for any gene, 
all values throughout the entire set of experimental conditions are calculated by comparison to the median 
5 level of that gene throughout the dataset. This allows a clearer appreciation of the effects of hypoxia in 
the context of cell activation status. The following data demonstrates that of the newly discovered genes 
responsive to hypoxia, expression changes are also seen in response to key cytokines of the immune 
system, implying functions outside of the generic response to hypoxia and metabolism. This especially 
applies to unannotated genes, including ESTs and hypothetical proteins, showing potential functions in 
10 inflammation and angiogenesis on the basis of cytokine-regulation. 

Macrophages were derived and cultured as described elsewhere in the specification. A total of 6 
experimental conditions were analysed, as shown below. Where cells were treated with cytokines or 
hypoxia (0.1% oxygen), this was for 6 hr. Lipopolysaccharide (LPS) (from Exoli 026:B6; Sigma), 
gamma Interferon (IFN) and Interleukin-10 (IL-10) were all used at a final concentration of lOOng/ml. 
15 The effect of gamma Interferon and Lipopolysaccharide is to activate macrophages, with a Thl biased 
phenotype, as found in many inflammatory conditions. Interleukin-10 is a Th2 cytokine and de-activates 
macrophages, and suppresses their effector functions. 

Experimental 
Condition 



20 1. No cytokines Normoxia 

2. No cytokines Hypoxia 

3. IL-10 Normoxia 

4. IL-10 Hypoxia 

5. LPS+IFN Normoxia 
25 6. LPS+IFN Hypoxia 



In Table 15, genes are shown which respond to LPS+IFN in normoxia by producing at least a 2-fold 
increase in expression, indicating probable pro-inflammatory functions. From this dataset various patterns 
of hypoxia regulation will be appreciated on top of the effect of LPS+IFN. 

For instance, the gene SCYA8 (p 1 12 1 ; SeqID: 479/480) is decreased in expression by hypoxia, changing 
30 from 0.54 to 0.18 between conditions #1 and #2. In condition #5 (LPS+IFN normoxia), expression is 
dramatically increased to a value of 19.6. When LPS+IFN is combined with hypoxia, this increase is 
dampened-down to a value of 12.2. So for this example, hypoxia and cell activation have opposing effects 
on gene expression. A similar expression profile is found for several other genes in Table 15. 
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In contrast, the gene P8 protein-candidate of metastasis 1 (p 1 F 1 7 ; SeqID: 329/330) is increased in 
expression by hypoxia, changing from 0.26 to 1,78 between conditions #1 'and #2. In condition #5 
(LPS+IFN normoxia) expression is increased from condition #1 to a value of 1.16. In condition #6, 
(LPS+IFN normoxia) the expression is further increased to a value of 2.59. So for this example, hypoxia 
5 and cell activation have similar effects on expression (i.e. increases) and these are found to be synergistic. 
A similar expression profile is found for several other genes in Table 15, including for Semaphorin 4b 
(plP14; SeqID :91/92/92a), which has been independently verified by RNase protection assay (see Figure 
57). . 

A selection of novel genes taken from Table 15 is also presented as Figure 53*. These novel genes are 
10 hence annotated here for the first time as being regulated not only by hypoxia, but also by Thl 
inflammatory signals, as provided by LPS+IFN. 

It will be appreciated that certain IMAGE clones were classed as novel and unannotated when the original 
patent filing was made (8 Dec 2000), but which can now be assigned to named genes. These are Uridine 
5 1 monophosphate hydrolase 1 (clone plI7; SeqID: 49/50) and Insulin induced protein 2 (clone pIDIO; 
15 SeqID:75/76). 

In Table 16, genes are shown which respond to LPS+IFN in normoxia by producing at least a 2-fold 
decrease in expression. From this dataset, various patterns of hypoxia regulation will be appreciated on 
top of the effect of LPS+IFN. 

In Figure 54, novel genes from Table 16 which are down-regulated by LPS+IFN and up-regulated by 
20 hypoxia are presented. For most of these, the combined effect of LPS+IFN AND hypoxia produces only a 
minor induction above the level of expression for activated normoxic cells (for example plF8/ SeqID;10/ 
Hypothetical Protein KIAA0914). In other cases, this is not the case, and hypoxia is able to over-ride the 
inhibitory effect of LPS+IFN on gene expression (for example plD12/ SeqID:30/ Hypothetical Protein 
KIAA1376). This clearly demonstrates the finding that different cell types or physiological states of a cell 
25 type (as here), respond to hypoxia differently. 

In Figure 55, novel genes from Table 16 which are down-regulated both by LPS+IFN and by hypoxia are 
presented. In many of the genes presented here, these stimuli are synergistic, with minimal expression 
obtained with a combination of LPS+IFN and hypoxia. 

In Figure 56, a selection of named genes from Table 16 which are down-regulated by LPS+IFN, with 
30 various responses to hypoxia are presented. For the gene, Max-interacting Protein 1 two separate clones 
were available on the array corresponding to this gene (plG5 from SeqID:280 and plD22 from 
SeqlD:120). In the original specification, the IMAGE clone corresponding to SeqID:120 (accession 
AA401496) was classified as an EST, and the IMAGE clone corresponding to SeqlD:280 (accession 
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AA401496) was classified as "Max-interacting Protein 1", as determined by the UniGene database at that 
time. Now it is apparent that both of these clones correspond to Max-interacting Protein 1, explaining the 
similarity of their expression profiles in Figure 56. Clearly the response of this gene to hypoxia is 
inhibited by LPS+IFN. 

5 The additional data showing effects of the Thl activation stimulus LPS+IFN extends the finding of these 
genes as novel hypoxia regulated genes, and provides additional information about the relevance of these 
genes to disease mechanisms, 

It will be appreciated that certain IMAGE clones were classed as novel and unannotated when the original 
patent filing was made (8 Dec 2000), but which can now be assigned to named genes, These are TRIP- 
10 Br2 (clone plD15; SeqID :3 1/32), MAX -interacting protein 1 (clone plD22; SeqID : 1 1 9/1 20). 

In Tables 15 and 16 and Figures 53-56, showing genes which respond to LPS+IFN, it will be noticed that 
some of these genes also response to the inhibitory cytokine IL-10 (e.g. Semaphorin 4b, Hypothetical 
protein CGI-117). Other genes respond only to IL-10, but not to LPS+IFN, Specific responses to IL-10 
are significant because this cytokine has been shown to have utility in suppressing inflammatory reactions 
15 (HuizingaTW et aL, Rheumatology 2000, 39: 1180-8). 

Table 17 shows genes responsive to IL-10 (increased or decreased) but not affected significantly by 
LPS+IFN. Various patterns of hypoxia regulation will be appreciated. 

Example 5: Gene expression in human tumors 

One of the utilities of the genes identified herein relates to the diagnosis and treatment of human tumors, 
20 on the basis that hypoxia is frequently found in tumors. 

A study has been performed to examine the expression of these genes in a selection of breast and ovary 
tumors, comparing expression with normal adjacent tissue from the same patient. There is expected to be 
a large degree of variation between different patients, and the study here contains only 5 patients with a 
range of diagnoses. Therefore although certain genes will be identified from this data, other genes in the 
25 current specification not flagged by this study are nevertheless likely to have utility in cancer. 

Patients are designated as Letters; 

E: 50 year old Caucasian female. Diagnosis: ovarian adenocarcinoma. Normal ovarian tissue derived 
from an age-matched separate individual. 

F: 60 year old female. Diagnosis: poorly differentiated adenocarcinoma. Normal ovarian tissue derived 
30 from the same individual. 
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G: 41 year old female. Diagnosis: moderately-differentiated adenocarcinoma. Normal ovarian tissue 
derived from the same individual. 

H: 40 year old female. Diagnosis; invasive ductal carcinoma. Normal breast tissue derived from the same 
individual. 

5 K; 58 year old female. Diagnosis: invasive ductal carcinoma. Normal breast tissue derived from the same 
individual. 

Data normalisation was done per-chip to correct for differences in labelling and hybridisation efficiency. 
Per-gene normalisation was done such that the expression values of each gene are relative to the median 
value of that gene throughout the series of samples. By comparing the expression values under normal 
10 (nor) and tumor (turn) for a single patient, differences in expression between the normal and malignant 
tissue of that patient can be inferred. 

In Table 18 are genes which are up-regulated at least 3-fold in at least one patient, comparing the tumor 
tissue to the adjacent normal tissue. 

In Table 19 are genes which are down-regulated at least 3-fold in at least one patient, comparing the 
15 tumor tissue to the adjacent normal tissue. 

Example 6: Effects of inflammatory cytokines on hypoxia-regulated genes 

Tumor necrosis factor alpha (TNFa) is a key pro-inflammatory cytokine both produced by and acting on 
the macrophage. The significance of TNFa to human disease is well established in the art. 
This is particularly the case in Rheumatoid arthritis and neutralising antibodies to TNFa have been 

20 reported to offer clinical utility. Because hypoxia is another pathological condition 
exerted on macrophages in the synovia of RA patients, synergistic effects of these 
two stimuli are highly relevant to the discovery of novel inflammatory targets 
expressed by the macrophage. To investigate this, primary human macrophages 
we re exposed to either hypoxia (0.1% oxygen) or 100 ng/ml TNFa or to both for 6hr. The 

25 data shown below provides further credence to the utility of the encoded proteins as inflammatory targets 
in macrophages and applies to any disease where hypoxia and TNFa a re co-in cident, 

Gene expression levels were measured and compared using the custom gene array. In data analysis per- 
gene normalisation was set up such that expression values represent the fold-change compared with the 
expression in untreated normoxic cells. Genes which are increased in expression in response to TNFaby 
30 at least 2-fold, in either normoxic or hypoxic cells, are shown in Table 20. Genes which are decreased in 
expression in response to TNFaby at least 2-fold, in either normoxic or hypoxic cells, are shown in Table 
21. 
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Another inflammatory cytokine implicated in diseases where hypoxia is frequently 
found is In terleu kin - 1 7 (IL-17). For example, this cytokine has been shown to 
. mediate inflammation and joint destruction in arthritis (Lubberts et al JJmmunol 
200 1 1 67:1 004- 1 0 1 3). IL-17 has also been shown to stimulate macrophages to 
5 release other key pro-inflammatory cytokines (Jovanovic et a 1 J Immunol 1998 
1 60:3 5 1 3 -2 1 ). Therefore genes which respond to both hypoxia and IL-17 are 
especially likely to be relevant to disease processes and have utility in the design of 
therapeutic products. Genes which are increased in expression in response to IL-17 by at least 2- 
fold, in either normoxic or hypoxic cells, are shown in Table 22. Genes which are decreased in expression 
10 in response to IL-17 by at least 2-fold, in either normoxic or hypoxic cells, are shown in Table 23. 

The cytokine IL-15 is implicated in several disease in which macrophages and hypoxia both feature as 
elements of the inflammatory state, such as in atherosclerosis (Wuttge DM et al Am J Pathol 2001 
159:417-23) and rheumatoid arthritis (Mclnnes IB et al Immunol Today. 1998 19:75-9). Although the 
main taigtt of IL-15 is T-cells effects have also been shown on monocytes (Badolato R et al Blood. 1997 
15 90:2804-9). Therefore genes which respond to both hypoxia and IL-15 are especially 
likely to be relevant to disease processes and have utility in the design of 
therapeutic products. Genes which are increased in expression in response to IL-15 by at least 2- 
fold, in either normoxic or hypoxic cells, are shown in Table 24. Genes which are decreased in expression 
in response to IL-15 by at least 2-fold, in either normoxic or hypoxic cells, are shown in Table 25. 

20 Example 7: Rat foetal cardiomyocytes 

Primary rat foetal cardiomyocytes provide an attractive experimental model for studying the responses of 
cardiac cells to ischaemia. Cells are obtained which are non-immortalised and which are seen to contract 
or beat in culture. It is of interest to examine how the responses of these cells to hypoxia (or related 
experimental conditions) compared and contrasts to other cell types. These other cell types might include 

25 those that are similarly sensitive to the effects of hypoxia (such as neurones) or might be cells that show a 
higher tolerance to hypoxia (such as macrophages). Experiments are performed in parallel for 
cardiomyocytes and other cell type(s). The responses of these specific cell types is then determined by 
hybridising labelled mRNA to microarrays. Alternative methods will include the construction of 
subtracted cDNA libraries for the individual treated cell types and assessing which genes are contained 

30 therein by sequencing. 

Methods 

Cardiomyocytes are harvested from heart ventricles of embryos aged E18 days, using a cell isolation kit 
(Neonatal cardiomyocyte isolation system; Worthington Biochemical Corporation, Lakewood, New 
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Jersey, 08701). They are seeded at 5xl0 6 cells/lOOcm diameter petri dish in DMEM/M199, 10% horse 
serum, 5% FCS, 1% penicillin, streptomycin, glutamine for 5 days at 37C, Media is changed during the 5 
days. 

Other cell types used for comparison with cardiomyocytes, are cultured according to their optimum 
5 conditions and/ or the standard routine. These cell types may include cardiomyocytes in a different 
physiological setting, such as in an intact beating heart, or a different developmental state of the 
cardiomyocyte, such as cardiomyoblast. 

Identical seeded petri dishes are placed either in a standard tissue culture incubator (95% air/ 5% C02) or 
in a hypoxia incubator (0.1% oxygen / 5% C02 / 0.1% oxygen for 6 hours. This is done separately for 
10 both cardiomyocytes and the other cell type(s) to be compared. Other experimental conditions might more 
closely approximate ishemia, by incorporating components additional to hypoxia, 

At the end of the exposure to hypoxia, cells are placed on a chilled platform, washed in cold PBS and 
total RNA is extracted using RNazol B (Tel-Test, Inc; distributed by Biogenesis Ltd) following the 
manufacturers instructions. Where appropriate, polyadenylated mRNA is extracted from the total RNA 
15 using a commercial kit following the manufacturers instructions (Promega; PolyATract mRNA isolation 
System IV), 

Array hybridisations and construction/analysis of subtracted cDNA libraries are performed according to 
standard methods or as described elsewhere in this specification. 

Example 8: Comparison of the hypoxic-responses between populations of rat primary cultured 
20 neurons by a subtraction cloning / array screening approach. 

Different regions of the central nervous system display different sensitivities to hypoxia and to ischaemia. 
Susceptibility to tissue- damage in this manner may occur as a result of intrinsic differences in gene 
expression between cells. To evaluate this hypothesis, primary cultures of rat neurons from different 
regions of the brain are established. Cultures are exposed to various experimental conditions which are 

25 pertinent to pathologies of the hypoxic/ischemic brain. These would include hypoxic insults as have been 
described, or to hypoxia/ischaemia where the conditions more closely approximate pathological ischemia. 
Either condition may be preceded by prior hypoxic-preconditioning, where transient exposure to hypoxia 
renders cells less sensitive to subsequent acute treatment. For all possible experimental treatments, a 
similar routine is performed for distinct neuron subtypes, in order to compare their responses. Such 

30 comparisons may be made by hybridizing labelled mRNA to microarrays or derivatives thereof. 
Alternatively subtracted libraries might be constructed individually for each treated neuron subtype, and 
clones which are confirmed to be changed in expression to be sequenced. The collection of genes arising 
from the different neuron subtypes will be compared. 
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Methods 

Primary cultures are established according to standard procedures from embryonic rats aged from E14 to 
E18 (Dunnett SB, Bjorkland A (Eds,) 1992, Neural Transplantation, A Practical Approach. IRL Press). 
Isolated neurons include but are not limited to those from ventral mesencephalon, striatum, hippocampus, 
5 cerebellum, cerebral cortex, dorsal root ganglia and superior cervical ganglia. 

Cells are maintained in culture for 3-14 days in humidified culture incubators at 37°C, 5% C02, 95% air 
(Normoxia) in Neurobasal Medium (Brewer GJ, 1995, Journal of Neuroscience Research 42:674-83) 
supplemented with B27 (both Life Technologies), For the hypoxia-preconditioning, cells are transferred 
to a second incubator at 37°C, 5% C02, 94,9% Nitrogen, 0.1% Oxygen (Hypoxia) for 30-180 minutes 

10 and returned to the normoxic incubator for 24 hours (Pringle et a/,, 1997, Neuropathology and Applied 
Neurobiology 23:289-298), For the hypoxic stimulus, either independent from or subsequent to hypoxia- 
preconditioning, cells are transferred to the hypoxic incubator for 2-6 hours as determined in time course 
experiments, Additionally, as appropriate, the medium in which the cells are grown is replaced with 
glucose-free media for establishment of experimental ischaemia (Ray AM, Owen DE, Evans ML, Davis 

15 JB Benham, 2000. Caspase inhibitors are functionally neuroprotective against oxygen glucose 
deprivation induced CA1 death in rat organotypic hippocampal slices). At the end of the exposure to 
hypoxia (or hypoxia/ischaemia), cells are, placed on a chilled platform, washed in cold PBS and total 
RNA is extracted using RNazol B (Tel-Test, Inc; distributed by Biogenesis Ltd) following the 
manufacturers instructions, Where appropriate, polyadenylated mRNA is extracted from the total RNA 

20 using a commercial kit following the manufacturers instructions (Promega; PolyATract mRNA isolation 
System IV). 

Array hybridisations and construction/analysis of subtracted cDNA libraries are performed according to 
standard methods or as described elsewhere in this specification. 

Example 9: Semaphorin 4b 

25 We have screened cDNA libraries derived from the human brain and leukocytes, to obtain an unequivocal 
and accurate full length cDNA sequence (SEQ ID No 92a) and the accurate presumptive amino acid 
sequence (SEQ ID No 91). 

The amino acid sequence above was derived by taking the first ATG. We have various independent lines 
of evidence that this is the bona fide translation initiation codon, 

30 Basic analysis of this sequence, reveals the following motifs: 
signal peptide (pSORT) Start: 1 End: 37; 
Transmembrane (pSORT) Start: 718 End: 734; 
cleavage site (pSORT) Start: 38 End: 38; 
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Proline rich region Start: 758 End; 824; 
Sema domain (pfam) Start: 70 End: 503; 
Plexin repeat (pfam) Start: 525 End: 548; 
integrin, beta domain (pfam) Start: 532 End: 546; 
5 cytoplasmic tail Start: 735 End: 837. 

To confirm the hypoxic regulation of Sema4b, we used RNase protection assay (see Figure 57). Hypoxia 
is a feature of several inflammatory conditions often accompanied by superoxide radicals and the immune 
regulator gamma interferon. In this experiment we have made the following findings: 

• Expression is activated by hypoxia (3,3 fold) 

10 • Expression is activated by gamma interferon and LPS (3.9 fold) 

• Expression is activated synergistically by hypoxia plus gamma interferon/ LPS (7.3 fold) 

• Expression is activated by superoxide radicals (5.0 fold) 

To investigate the size of the mRNA and the tissue distribution, Northern blotting was done (see Figure 
58). This shows that the gene is expressed as a single transcript at relatively low levels in unstimulated 
15 human tissues. 

We have also found that a molecule that is probably associated with Semaphorin 4B, called psd-95 is 
another macrophage hypoxia-induced protein (see SEQ ID No 299). This is based on the fact that psd-95 
binds the cytoplasmic tail of Sema4c (Inagaki et al, J Biol Chem. 2001; 276(12): 9174-81), which like 
Sema4b, contains proline rich sequence. Therefore, both Semaphorin 4B, and a probable partner are co~ 
20 ordinately regulated by hypoxia. 

Example 10: Discussion of relevance of individual clones 

The Oxford BioMedica clone plF12 represents Hypothetical protein FLJ13611. The protein sequence 
encoded by Hypothetical protein FLJ13611 is represented in the public databases by the accession 
NP_079217 and is described in this patent by Seq ID 1. The nucleotide sequence is represented in the 
25 public sequence databases by the accession NM_024941 and is described in this patent by Seq ID 2, 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 

The Oxford BioMedica clone plF2 represents Hypothetical protein FLJ20037, The protein sequence 
30 encoded by Hypothetical protein FLJ20037 is represented in the public databases by the accession 
CAB65981 and is described in this patent by Seq ID 3. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_017633 and is described in this patent by Seq ID 4. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
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implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. Hypoxia is frequently found in human tumours where macrophage infiltrates are also found, In 
a series of 5 patients with either ovarian or breast cancer, Hypothetical protein FLJ20037 is down- 
regulated in the malignant tissue as compared to adjacent normal tissue in at least one patient. We expect 
5 increased activity of the gene product to have an anti-tumour effect. 

The Oxford BioMedica clone plFlO represents Hypothetical protein DKFZp434P0116. The protein 
sequence encoded by Hypothetical protein DKFZp434P0116 is represented in the public databases by the 
accession T46364 and is described in this patent by Seq ID 5. The nucleotide sequence is represented in 
the public sequence databases by the accession NM.017593 and is described in this patent by Seq ID 6. 
10 Hypothetical protein DKFZp434P0116 is predicted to be a kinase due to high structural similarity with 
other known kinases, Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. Hypothetical protein DKFZp434P01 16 is repressed in macrophages 
activated by LPS and gamma interferon. We expect it to have an anti-inflammatory role. 

15 The Oxford BioMedica clone p 1 Fl 9 represents Hypothetical protein KIAA0212. The protein sequence 
encoded by Hypothetical protein KIAA0212 is represented in the public databases by the accession 
BAA13203 and is described in this patent by Seq ID 7. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_014674 and is described in this patent by Seq ID 8. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 

20 implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 

The Oxford BioMedica clone plF8 represents Hypothetical protein KIAA0914. The protein sequence 
encoded by Hypothetical protein KIAA0914 is represented in the public databases by the accession 
NP_055698 and is described in this patent by Seq ID 9, The nucleotide sequence is represented in the 

25 public sequence databases by the accession NM__014883 and is described in this patent by Seq ID 10. 
Hypothetical protein KIAA0914 shows high structural similarity to Human Class I alpha 1,2- 
Mannosidase and conservation of active site and binding site residues, therefore we predict that 
Hypothetical protein KIAA0914 will act as a mannosidase. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 

30 utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. Hypothetical protein 
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KIAA0914 is repressed in macrophages activated by LPS and gamma interferon. We expect the gene 
product to have an anti-inflammatory role. 

The Oxford BioMedica clone plF5 represents Hypothetical protein FLJ20281. The protein sequence 
encoded by Hypothetical protein FLJ20281 is represented in the public databases by the accession 
5 XP_008736 and is described in this patent by Seq ID 11. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_017742 and is described in this patent by Seq ID 12. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. TNFalpha is an inflammatory cytokine, which acts on macrophages, and has been shown to be 
10 central to the pathophysiology and treatment of diseases including rheumatoid arthritis. Genes that change 
in expression in response to TNFalpha therefore have utility in the design of therapeutic, prognostic and 
diagnostic products for such inflammatory conditions. Hypothetical protein FLJ20281 is induced in 
.macrophages activated by TNFalpha. 

The Oxford BioMedica clone plFl 8 represents Hypothetical protein KIAA0876. The protein sequence 
15 encoded by Hypothetical protein KIAA0876 is represented in the public databases by the accession 
BAA74899 and is described in this patent by Seq ID 13. The nucleotide sequence is represented in the 
public sequence databases by the accession XM_035625 and is described in this patent by Seq ID 14. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
20 products. 

The Oxford BioMedica clone plF7 represents Spectrin, beta, non-erythrocytic 1. The protein sequence 
encoded by Spectrin, beta, non-erythrocytic 1 is represented in the public databases by the accession 
NP_003119 and is described in this patent by Seq ID 15. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_003128 and is described in this patent by Seq ID 16. 

25 Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In 
a series of 5 patients with either ovarian or breast cancer, Spectrin, beta, non-erythrocytic 1 is down- 
regulated in the malignant tissue as compared to adjacent normal tissue in at least one patient. We expect 

30 increased activity of the gene product to have an anti-tumour effect. 

The Oxford BioMedica clone plF21 represents Hematopoietic Zinc finger protein. The protein sequence 
encoded by Hematopoietic Zinc finger protein is represented in the public databases by the accession 
AAL08625 and is described in this patent by Seq ID 17. The nucleotide sequence is represented in the 
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public sequence databases by the accession AK024404 and is described in this patent by Seq ID 18. 
Hematopoietic Zinc finger protein is a transcriptional regulator that contains a Cys2~His2 zinc finger 
motif, It is predicted to bind to metal response elements (MRE) and therefore activate the transcription of 
genes that contain a MRE sequence within their promoter region such as metallothioneins. Hypoxia is an 
5 important feature of several diseases, and genes that respond to this stimulus . are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products, There is 
a prejudice in the art that the response to hypoxia is generic to all cell types. Contrary to this, we show 
that genes are regulated by hypoxia to a greater degree in certain cell types, substantiating their utility in 
designing specific therapeutic products for diseases involving those cell types. Monocytes and 
10 macrophages have been implicated in the following diseases involving hypoxia; rheumatoid arthritis, 
atherosclerosis, cancer, COPD. Hematopoietic Zinc finger protein is preferentially induced by hypoxia in 
monocytes or macrophages and a restricted number of other cell types. It is therefore a candidate for 
specific intervention for treatment or diagnosis of the above diseases. 

The Oxford BioMedica clone plF9 represents Hypothetical protein KIAA0742. The protein sequence 
15 encoded by Hypothetical protein KIAA0742 is represented in the public databases by the accession 
NP_060903 and is described in this patent by Seq ID 19. The nucleotide sequence is represented in the 
public sequence databases by the accession AB018285 and is described in this patent by Seq ID 20. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
20 products. Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory 
processes. In these, macrophages are frequently activated by cytokines, which have been shown to be 
present at disease sites, so gene expression responses to both hypoxia and cytokines are especially 
relevant, Hypothetical protein KIAA0742 is repressed in macrophages activated by LPS and gamma 
interferon. We expect it to have an anti-inflammatory role. Hypothetical protein KIAA0742 shows 
25 significant homolgy to the transcription factor hairless. We therefore propose that Hypothetical protein 
KIAA0742 may play a crucial role in the regulation of hair growth, Accordingly, this aspect of the 
invention includes the use of this protein, fragments and functional equivalents of this protein, encoding 
nucleic acid molecules, in addition to ligands that bind specifically to this protein, in the diagnosis and 
treatment of hair loss. 

30 The Oxford BioMedica clone plEl 3 represents Hypothetical protein PRO0823. The protein sequence 
encoded by Hypothetical protein PRO0823 is represented in the public databases by the accession 
AAF71073 and is described in this patent by Seq ID 21. The nucleotide sequence is represented in the 
public sequence databases by the accession AF1 16653 and is described in this patent by Seq ID 22. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
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implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory 
processes. In these, macrophages are frequently activated by cytokines, which have been shown to be 
present at disease sites, so gene expression responses to both hypoxia and cytokines are especially 
5 relevant. Hypothetical protein PRO0823 is repressed in macrophages activated by LPS and gamma 
interferon. We expect it to have an anti-inflammatory role. Hypoxia is frequently found in human tumours 
where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, 
Hypothetical protein PRO0823 is down-regulated in the malignant tissue as compared to adjacent normal 
tissue in at least one patient* We expect increased activity of the gene product to have an anti-tumour 
10 effect. 

The Oxford BioMedica clones plDl and plD2 represent the Hypothetical protein FLJ10134. The protein 
sequence encoded by Hypothetical protein FLJ10134 is represented in the public databases by the 
accession NP_060474 and is described in this patent by Seq ID 23. The nucleotide sequence is 
represented in the public sequence databases by the accession NM_018004 and is described in this patent 

15 by Seq ID 24. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
.diagnostic products. HIF1 alpha and EPAS1 are transcription factors that mediate the response to hypoxia 
of several genes, and have them selves been implicated in specific diseases* By adenoviral over- 
expression of EPAS1 we show augmentation of the hypoxic induction of certain genes, further 

20 confirming their status as responsive to hypoxia. Hypothetical protein FLJ10134 has been shown to be 
induced by hypoxia to a greater degree following adenoviral over-expression of EPAS1. Macrophages are 
key to several diseases involving hypoxia, and contribute to inflammatory processes. In these, 
macrophages are frequently activated by cytokines, which have been shown to be present at disease sites, 
so gene expression responses to both hypoxia and cytokines are especially relevant. Hypothetical protein 

25 FLJ10134 is repressed in macrophages activated by LPS and gamma interferon. We expect it to have an 
anti-inflammatory role. Hypoxia is frequently found in human tumours where macrophage infiltrates are 
also found. In a series of 5 patients with either ovarian or breast cancer, Hypothetical protein FLJ10134 is 
down-regulated in the malignant tissue as compared to adjacent normal tissue in at least one patient. We 
expect increased activity of the gene product to have an anti-tumour effect. 

30 The Oxford BioMedica clone plD4 represents Hypothetical protein FLJ20500. The protein sequence 
encoded by Hypothetical protein FLJ20500 is represented in the public databases by the accession 
NP_061931 and is described in this patent by Seq ID 25. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_019058 and is described in this patent by Seq ID 26. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
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implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products, Hypothetical protein FLJ20500 is preferentially induced by hypoxia in mammary epithelial 
cells. 

The Oxford BioMedica clone plD9 represents Hypothetical protein DKFZP564D116. The protein 
5 sequence encoded by Hypothetical protein DKFZP564D116 is represented in the public databases by the 
accession T08708 and is described in this patent by Seq ID 27. The nucleotide sequence is represented in 
the public sequence databases by the accession AL050022 and is described in this patent by Seq ID 28. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 

10 products. Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory 
processes. In these, macrophages are frequently activated by cytokines, which have been shown to be 
present at disease sites, so gene expression responses to both hypoxia and cytokines are especially 
relevant. Hypothetical protein DKFZP564D116 is repressed in macrophages activated by LPS and 
gamma interferon. TNFalpha is an inflammatory cytokine, which acts on macrophages, and has been 

15 shown to be central to the pathophysiology and treatment of diseases including rheumatoid arthritis. 
Genes that change in expression in response to TNFalpha therefore have utility in the design of 
therapeutic, prognostic and diagnostic products for such inflammatory conditions. Hypothetical protein 
DKFZP564D116 is induced in macrophages activated by TNFalpha. 

The Oxford BioMedica clone plD12 represents Hypothetical protein KIAA1376. The protein sequence 
20 encoded by Hypothetical protein KIAA1376 is represented in the public databases by the accession 
BAA92614 and is described in this patent by Seq ID 29. The nucleotide sequence is represented in the 
public sequence databases by the accession AB037797 and is described in this patent by Seq ID 30. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
25 products. Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory 
processes. In these, macrophages are frequently activated by cytokines, which have been shown to be 
present at disease sites, so gene expression responses to both hypoxia and cytokines are especially 
relevant. Hypothetical protein KIAA1376 is repressed in macrophages activated by LPS and gamma 
interferon. We expect it to have an anti-inflammatory role. 

30 The Oxford BioMedica clone plD15 represents TRIP-Br2, The protein sequence encoded by TRIP-Br2 is 
represented in the public databases by the accession NPJ555570 and is described in this patent by Seq ID 
31. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_014755 and is described in this patent by Seq ID 32.. TRIP-BR2 is a PHD zinc finger and 
bromodomain interacting protein transcriptional regulator and is involved in the regulation of cell cycle 
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progression. Its hypoxia-regulation is likely to have important disease-relevant effects. Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. The 
response of renal epithelial cells to hypoxia is pertinent to kidney failure, especially regarding the 
5 medullary tissue. TRIP-Br2 is preferentially induced by hypoxia in renal epithelial cells. Macrophages are 
key to several diseases involving hypoxia, and contribute to inflammatory processes. In these, 
macrophages are frequently activated by cytokines, which have been shown to be present at disease sites, 
so gene expression responses to both hypoxia and cytokines are especially relevant. TRIP-Br2 is 
repressed in macrophages activated by LPS and gamma interferon. 

10 The Oxford BioMedica clone plD 16 represents Hypothetical protein FLJ20308. The protein sequence 
encoded by Hypothetical protein FLJ20308 is represented in the public databases by the accession 
XP_039852 and is described in this patent by Seq ID 33. The nucleotide sequence is represented in the 
public sequence databases by the accession AK000315 and is described in this patent by Seq ID 34. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 

15 implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory 
processes. In these, macrophages are frequently activated by cytokines, which have been shown to be 
present at disease sites, so gene expression responses to both hypoxia and cytokines are especially 
relevant. Hypothetical protein FLJ20308 is repressed in macrophages activated by LPS and gamma 

20 interferon. We expect it to have an anti-inflammatory role. 

The Oxford BioMedica clone p 1 J 1 3 represents Hypothetical nuclear factor SBBI22. The protein sequence 
encoded by Hypothetical nuclear factor SBBI22 is represented in the public databases by the accession 
NP_065128 and is described in this patent by Seq ID 35. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_020395 and is described in this patent by Seq ID 36. 
25 Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 

The Oxford BioMedica clone p 1 122 represents Hypothetical protein KIAA1429. The protein sequence 
encoded by Hypothetical protein KIAA1429 is represented in the public databases by the accession 
30 BAA92667 and is described in this patent by Seq ID 37. The nucleotide sequence is represented in the 
public sequence databases by the accession AB037850 and is described in this patent by Seq ID 38. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 
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The Oxford BioMedica clone plJ6 represents Hypothetical protein FLJ10206, The protein sequence 
encoded by Hypothetical protein FLJ10206 is represented in the public databases by the accession 
AAH06108 and is described in this patent by Seq ID 39. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_018025 and is described in this patent by Seq ID 40, 
5 Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory 
processes. In these, macrophages are frequently activated by cytokines, which have been shown to be 
present at disease sites, so gene expression responses to both hypoxia and cytokines are especially 
10 relevant. Hypothetical protein FLJ10206 is repressed in macrophages activated by IL-17 and is also 
repressed in macrophages activated by IL-15. These are pro-inflammatory cytokines, and we expect the 
hypothetical protein FLJI0206 to have an anti-inflammatory role. 

The Oxford BioMedica clone p 1 15 represents Hypothetical protein FL Jl 0815. The protein sequence 
encoded by Hypothetical protein FLJ10815 is represented in the public databases by the accession 

15 BAA91830 and is described in this patent by Seq ID 41. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_01 823 1 and is described in this patent by Seq ID 42. 
Hypothetical protein FLJ10815 is structurally similar to an alpha / beta barrel structure. Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 

20 Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 
these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 
sites, so gene expression responses to both hypoxia and cytokines are especially relevant. Hypothetical 
protein FLJ10815 is repressed in macrophages activated by LPS and gamma interferon. We expect it to 
have an anti-inflammatory role. 

25 The Oxford BioMedica clone p 1 II 3 represents Hypothetical protein FLJ11100. The protein sequence 
encoded by Hypothetical protein FLJ11100 is represented in the public databases by the accession 
NP_060701 and is described in this patent by Seq ID 43. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_01 8321 and is described in this patent by Seq ID 44. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 

30 implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 

The Oxford BioMedica clone pi 117 represents Hypothetical protein FLJ20644. The protein sequence 
encoded by Hypothetical protein FLJ20644 is represented in the public databases by the accession 
NP_060387 and is described in this patent by Seq ID 45. Hypothetical protein FLJ20644 is a putative 
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Serine/threonine phosphotase. Region 250 - 450 shows high structural similarity to other Serine/threonine 
phosphotases. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_017917 and is described in this patent by Seq ID 46, Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
5 utility in the design of therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone pi II 5 represents Hypothetical protein CGI-117. The protein sequence 
encoded by Hypothetical protein CGI-117 is represented in the public databases by the accession 
Q9Y3C1 and is described in this patent by Seq ID 47, The nucleotide sequence is represented in the 
public sequence databases by the accession NM_01 6391 and is described in this patent by Seq ID 48. 

10 Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. HIFI alpha and EPAS1 are transcription factors that mediate the response to hypoxia of several 
genes, and have them selves been implicated in specific diseases. By adenoviral over-expression of 
HIFlalpha or EPAS1 we show augmentation of the hypoxic induction of certain genes, further 

15 confirming their status as responsive to hypoxia. Hypothetical protein CGI-117 has been shown to be 
induced by hypoxia to a greater degree following adenoviral over-expression of either HIFlalpha or 
EPAS1. Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory 
processes. In these, macrophages are frequently activated by cytokines, which have been shown to be 
present at disease sites, so gene expression responses to both hypoxia and cytokines are especially 

20 relevant. Hypothetical protein CGI-117 is repressed in macrophages activated by LPS and gamma 
interferon. We expect it to have an anti-inflammatory role. 

The Oxford BioMedica clone p 1 17 represents Uridine 5' monophosphate hydrolase 1. The protein 
sequence encoded by Uridine 5 l monophosphate hydrolase 1 is represented in the public databases by the 
accession NP_057573 and is described in this patent by Seq ID 49. The nucleotide sequence is 

25 represented in the public sequence databases by the accession NM_016489 and is described in this patent 
by Seq ID 50. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
diagnostic products. Macrophages are key to several diseases involving hypoxia, and contribute to 
inflammatory processes. In these, macrophages are frequently activated by cytokines, which have been 

30 shown to be present at disease sites, so gene expression responses to both hypoxia and cytokines are 
especially relevant. Uridine 5' monophosphate hydrolase 1 is induced in macrophages activated by LPS 
and gamma interferon and is also is induced in macrophages activated by IL-15. We expect it to have a 
pro-inflammatory role, and its inhibition may have an anti-inflammatory effect. 
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The protein sequence encoded by Hypothetical protein KIAA0014 is represented in the public databases 
by the accession NP_055480 and is described in this patent by Seq ID 51. The nucleotide sequence is 
represented in the public sequence databases by the accession NM_014665 and is described in this patent 
by Seq ID 52, Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
5 are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
diagnostic products. 

The Oxford BioMedica clone pi 14 represents Hypothetical protein HSPC196, The protein sequence 
encoded by Hypothetical protein HSPC196 is represented in the public databases by the accession 
NP_057548 and is described in this patent by Seq ID 53, The nucleotide sequence is represented in the 

10 public sequence databases by the accession NM_016464 and is described in this patent by Seq ID 54. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory 
processes. In these, macrophages are frequently activated by cytokines, which have been shown to be 

15 present at disease sites, so gene expression responses to both hypoxia and cytokines are especially 
relevant. Hypothetical protein HSPC196 is repressed in macrophages activated by LPS and gamma 
interferon. We expect it to have an anti-inflammatory role. 

The Oxford BioMedica clone p 1 18 represents Hypothetical protein FLJ1 1 296. The protein sequence 
encoded by Hypothetical protein FLJ1 1296 is represented in the public databases by the accession 
20 XP_004747 and is described in this patent by Seq ID 55. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_018384 and is described in this patent by Seq ID 56. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 

25 The Oxford BioMedica clone p 1 II 6 represents Hypothetical protein KIAA1668. The protein sequence 
encoded by Hypothetical protein KIAA1668 is represented in the public databases by the accession 
BAB 33338 and is described in this patent by Seq ID 57. The nucleotide sequence is represented in the 
public sequence databases by the accession AB051455 and is described in this patent by Seq ID 58. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 

30 implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 

The Oxford BioMedica clone pllll represents SECIS binding protein 2. The protein sequence encoded 
by SECIS binding protein 2 is represented in the public databases by the accession AAK57518 and is 
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described in this patent by Seq ID 59. The nucleotide sequence is represented in the public sequence 
databases by the accession AF380995 and is described in this patent by Seq ID 60, SECIS binding protein 
2 is a crucial component in the complex required for the translation of mammalian selenoprotein mRNAs. 
Selenoproteins are important responders to redox conditions and many selenoproteins are known to 
5 protect from cell death. Our demonstration of the hypoxia induction of SECIS binding protein 2 opens 
new avenues for diagnosis and therapeutic intervention. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Hypoxia is frequently found in 
human tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian 
10 or breast cancer, SECIS binding protein 2 is down-regulated in the malignant tissue as compared to 
adjacent normal tissue in at least one patient. We expect increased activity of the gene product to have an 
anti-tumour effect. 

The Oxford BioMedica clone plE8 represents cDNA: FLJ22249 fis, clone HRC02674. The sequence 
cDNA: FLJ22249 fis, clone HRC02674 is not represented in the public databases by a protein accession. 
15 The nucleotide sequence is represented in the public sequence databases by the accession AK025902 and 
is described in this patent by Seq ID 62. Hypoxia is an important feature of several diseases, and genes 
that respond to this stimulus are therefore implicated in the pathogenesis and have utility in the design of 
therapeutic, prognostic and diagnostic products, 

The Oxford BioMedica clone plE18 represents Plexin CI. The protein sequence encoded by Plexin CI is 
20 represented in the public databases by the accession NP_005752 and is described in this patent by Seq ID 
63. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_005761 and is described in this patent by Seq ID 64, Plexins are a large family of receptors for 
transmembrane, secreted, and GPI-anchored semaphorins in vertebrates and play a significant role in 
signal transduction [Tamagnone et al 1999, Cell 99:71-80]. Elsewhere in this patent we disclose hypoxic 
25 regulation of a new semaphorin 4b, and we propose co-regulation of these molecules by hypoxia and their 
relevance to inflammatory disease, its diagnosis and therapy. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
30 frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. Plexin CI is repressed in 
macrophages activated by LPS and gamma interferon. 

The Oxford BioMedica clone plE16 represents cDNA DKFZp586E1624. The sequence cDNA 
DKFZp586E1624 is not represented in the public databases by a protein accession. The nucleotide 
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sequence is represented in the public sequence databases by the accession AL1 10152 and is described in 
this patent by Seq ID 66. Hypoxia is an important feature of several diseases, and genes that respond to 
this stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. HIF1 alpha and EPAS1 are transcription factors that mediate the 
5 response to hypoxia of several genes, and have them selves been implicated in specific -diseases. By 
adenoviral over-expression of EPAS1 we show augmentation of the hypoxic induction of certain genes, 
further confirming their status as responsive to hypoxia. Its preferential regulation by EPAS1 provides a 
route to preferential intervention, to avoid toxicity to other tissues. The cDNA DKFZp586E1624 has been 
shown to be induced by hypoxia to a greater degree following adenoviral over-expression of EPAS1. 
10 Endothelial cells are key to angiogenesis, a process implicated in several diseases associated with 
hypoxia, including cancer and rheumatoid arthritis. The cDNA DKFZp586E1624 is preferentially 
induced by hypoxia in endothelial cells. We expect this gene product to have a pro-angiogenic effect, and 
its inhibition to have an anti-angiogenic effect. 

The Oxford BioMedica clones plD5 and plD6 represent ER01 (S. cerevisiae)-like. The protein sequence 

15 encoded by ER01 (S. cerevisiae)-like is represented in the public databases by the accession NP_055399 
and is described in this patent by Seq ID 67. The nucleotide sequence is represented in the public 
sequence databases by the accession NM_014584 and is described in this patent by Seq ID 68. ER01 (S. 
cerevisiae)-like has been shown to be a flavin adenine dinucleotide (FAD) binding protein* Binding of 
FAD enables ER01 (S. cerevisiae)-like to oxidise protein disulfide isomerase (PDI). We propose that the 

20 oxidisation of PDI by ER01 (S. cerevisiae)-like stops PDI autodegradation, therefore increasing levels of 
the protein. Increased levels of PDI have been shown to be neuroprotective by inhibiting apoptotic cell 
death. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are 
therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
diagnostic products. HIFlalpha and EPAS1 are transcription factors that mediate the response to hypoxia 

25 of several genes, and have them selves been implicated in specific diseases. By adenoviral over- 
expression of EPAS1 we show augmentation of the hypoxic induction of certain genes, further 
confirming their status as responsive to hypoxia. ER01 (S. cerevisiae)-like has been shown to be induced 
by hypoxia to a greater degree following adenoviral over-expression of EPAS1. Its preferential regulation 
by EPAS1 provides a route to preferential intervention, to avoid toxicity to other tissues. ER01 (S. 

30 cerevisiae)-like is preferentially induced by hypoxia in mammary epithelial cells. Macrophages are key to 
several diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. ER01 (S. cerevisiae)-like is 
repressed in macrophages activated by LPS and gamma interferon. Hypoxia is frequently found in human 
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tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast 
cancer, ER01 (S. cerevisiae)-like is up-regulated in the malignant tissue as compared to adjacent normal 
tissue in at least one patient, 

The Oxford BioMedica clone plE12 represents Hypothetical protein DKFZP434E1723, The protein 
5 sequence encoded by Hypothetical protein DKFZP434E1723 is represented in the public databases by the 
accession XP_05338 and is described in this patent by Seq ID 69. The nucleotide sequence is represented 
in the public sequence databases by the accession BC010005 and is described in this patent by Seq ID 70. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
10 products. 

The Oxford BioMedica clone plElO represents cDNA FLJ11041 fis clone PLACE1004405. The 
sequence encoded by cDNA FLJ11041 fis, clone PLACE1004405 is not represented in the public 
databases by a protein accession. The nucleotide sequence is represented in the public sequence databases 
by the accession AK001903 and is described in this patent by Seq ID 72, Hypoxia is an important feature 

15 of several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis 
and have utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to 
several diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. The cDNA FLJ11041 fis 

20 clone PLACE1004405 is induced in macrophages activated by LPS and gamma interferon. We expect it 
to have a pro-inflammatory role, and its inhibition may have an anti-inflammatory effect. 

The Oxford BioMedica clone p 1 C21 represents Tubulin, beta, 4. The protein sequence encoded by 
Tubulin, beta, 4 is represented in the public databases by the accession NP_006077 and is described in 
this patent by Seq ID 73. The nucleotide sequence is represented in the public sequence databases by the 
25 accession NM„006086 and is described in this patent by Seq ID 74. Hypoxia is an important feature of 
several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 
have utility in the design of therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone pIDIO represents Insulin induced protein 2. The protein sequence encoded 
by Insulin induced protein 2 is represented in the public databases by the accession AAD43048 and is 
30 described in this patent by Seq ID 75. The nucleotide sequence is represented in the public sequence 
databases by the accession AF125392 and is described in this patent by Seq ID 76, Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
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Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 
these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 
sites, so gene expression responses to both hypoxia and cytokines are especially relevant. Insulin induced 
protein 2 is induced in macrophages activated by LPS and gamma interferon. We expect it to have a pro- 
5 inflammatory role r and its inhibition may have an anti-inflammatory effect. 

The Oxford BioMedica clones p ID 13 and pi A22 represent Adenylate kinase 3. The protein sequence 
encoded by Adenylate kinase 3 is represented in the public databases by the accession NP_037542 and is 
described in this patent by Seq ID 77 and 263. The nucleotide sequence is represented in the public 
sequence databases by the accession NM_013410 and is described in this patent by Seq ID 78 and 264. 

10 Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory 
processes. In these, macrophages are frequently activated by cytokines, which have been shown to be 
present at disease sites, so gene expression responses to both hypoxia and cytokines are especially 

15 relevant. Adenylate kinase 3 is induced in macrophages activated by LPS and gamma interferon. 
TNFalpha is an inflammatory cytokine, which acts on macrophages, and has been shown to be central to 
the pathophysiology and treatment of diseases including rheumatoid arthritis. Genes that change in 
expression in response to TNFalpha therefore have utility in the design of therapeutic, prognostic and 
diagnostic products for such inflammatory conditions. Adenylate kinase 3 is induced in macrophages 

20 activated by TNFalpha. 

The Oxford BioMedica clone plE9 represents a novel PI-3-kinase adapter. The protein sequence encoded 
by the novel PI-3-kinase adapter is not represented in the public databases by a protein accession but is 
described in this patent by Seq ID 79. The nucleotide sequence of an unannotated EST corresponding to 
the novel PI-3-kinase adapter is represented in the public sequence databases by the accession R62339 

25 and is described in this patent by Seq ID 80. Hypoxia is an important feature of several diseases, and 
genes that respond to this stimulus are therefore implicated in the pathogenesis and have utility in the 
design of therapeutic, prognostic and diagnostic products. There is a prejudice in the art that the response 
to hypoxia is generic to all cell types, Contrary to this, we show that genes are regulated by hypoxia to a 
greater degree in certain cell types, substantiating their utility in designing specific therapeutic products 

30 for diseases involving those cell types. Monocytes and macrophages have been implicated in the 
following diseases involving hypoxia: rheumatoid arthritis, atherosclerosis, cancer, COPD and peripheral 
arterial disease. The novel PI-3-kinase adapter is preferentially induced by hypoxia in monocytes or 
macrophages, indicating utility of the encoded protein in the design of therapeutic, prognostic and 
diagnostic products addressing diseases involving macrophages and hypoxia. In a gene array analysis it is 
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expressed in hypoxic monocytes and macrophages at levels 6-fold higher than the median expression 
level of this gene throughout 9 other cell types in either normoxia or hypoxia. In more sensitive TaqMan 
analysis the novel PI-3-kinase adapter it is found to be expressed at approximately 1000 times the levels 
of 9 other cell types, all exposed to hypoxia for 1 8hr, The relevance of the novel PI-3-kinase adapter to 
5 human disease is also appreciated from comparison with a related murine gene, BCAP. It is known that 
this gene is phosphorylated by the tyrosine kinase, Syk. We also show novel data regarding Syk, in that it 
is also induced in response to hypoxia in a tissue specific manner identical to that of the novel PI-3-kinase 
adapter. Therefore the biological relevance and utility of our discovery of hypoxic induction of the novel 
PI-3-kinase adapter gene is further highlighted, 

10 

The Oxford BioMedica clone plFl represents an unannotated EST. The protein sequence encoded by this 
EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession AA489477 and is described in this patent 
by Seq ID 82. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
15 are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
diagnostic products, 

The Oxford BioMedica clone plE7 represents a novel. Metallothionein. The protein sequence encoded by 
Novel Metallothionein is not represented in the public databases by a protein accession but is described in 
this patent by Seq ID 83. The nucleotide sequence is represented in the public sequence databases by the 

20 accession R06601 and is described in this patent by Seq ID 84. Metallothioneins can act as an antioxidant 
and free-radical scavenger and are therefore protective against cell death in hypoxia. Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
HIF1 alpha and EPA SI are transcription factors that mediate the response to hypoxia of several genes, and 

25 have them selves been implicated in specific diseases. By adenoviral over-expression of HIF1 alpha we 
show augmentation of the hypoxic induction of certain genes, further confirming their status as 
responsive to hypoxia. The novel Metallothionein represented by Seq ID 84 has been shown to be 
induced by hypoxia to a greater degree following adenoviral over-expression of HIFlalpha. Hepatocytes 
are the main cell type of the liver and genes that are induced in response to hypoxia in this cell type are 

30 relevant to development of diagnostics and therapeutics towards liver diseases involving hypoxia, 
including cirrhosis. The novel Metallothionein represented by Seq ID 84 is preferentially induced by 
hypoxia in hepatocytes. Macrophages are key to several diseases involving hypoxia, and contribute to 
inflammatory processes. In these, macrophages are frequently activated by cytokines, which have been 
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shown to be present at disease sites, so gene expression responses to both hypoxia and cytokines are 
especially relevant. The novel Metallothionein represented by Seq ID 84 is induced in macrophages 
activated by LPS and gamma interferon. 

The Oxford BioMedica clone plE6 represents EGL nine (C.elegans) homolog 3. The protein sequence 
5 encoded by EGL nine (C.elegans) homolog 3 is represented in the public databases by the accession 
NP_071356 and is described in this patent by Seq ID 85. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_022073 and is described in this patent by Seq ID 86. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 

10 products. HIF1 alpha and EPAS1 are transcription factors that mediate the response to hypoxia of several 
genes, and have them selves been implicated in specific diseases. By adenoviral over-expression of 
EPAS1 we show augmentation of the hypoxic induction of certain genes, further confirming their status 
as responsive to hypoxia. EGL nine (C.elegans) homolog 3 has been shown to be induced by hypoxia to a 
greater degree following adenoviral over-expression of EPAS1. Its preferential regulation by EPAS1 

15 provides a route to preferential intervention, to avoid toxicity to other tissues. Hepatocytes are the main 
cell type of the liver and genes that are induced in response to hypoxia in this cell type are relevant to 
development of diagnostics and therapeutics towards liver diseases involving hypoxia, including 
cirrhosis. EGL nine (C.elegans) homolog 3 is preferentially induced by hypoxia in hepatocytes. We find 
that EGLN3 and a related human gene Clorfl2 (seq ID 89/90) both of which are predicted to be proline 

20 hydroxylases, are expressed at differing absolute expression levels in different tissues. For instance, in the 
hypoxic hepatocyte (6hr) the normalised expression values of EGLN and clorfl2 are 0.015 and 0.0074 
respectively, i.e. EGLN being the dominant gene. In contrast, in the neuroblastoma cell line SH-SY5Y, 
the normalised expression values of EGLN and cl orf 1 2 after 6hr hypoxia are 0.0012 and 0.108 
respectively, i.e. clorfl2 being the dominant gene by a large margin. This data demonstrates that 

25 clORF12 and EGLN3 are not constitutively expressed at an equal amount in different tissues indicating 
specificity of function. Therefore therapeutic products may be developed based on this data, with the goal 
of modulating proline hydroxylation of target proteins (such as HIFlalpha) in specific tissues, based on 
the differing expression profile of clORF12 and EGLN3 in those tissues. 

The Oxford BioMedica clone plD14 represents Clorfl2. The protein sequence encoded by Clorfl2 is 
30 represented in the public databases by the accession NP_071334 and is described in this patent by Seq ID 
89. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_022051 and is described in this patent by Seq ID 90, Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. We find that Clorfl2 and a related 
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human gene EGLN3 (seq ID 85/86) both of which are predicted to be proline hydroxylases, are expressed 
at differing absolute expression levels in different tissues. For instance, in the hypoxic hepatocyte (6hr) 
the normalised expression values of EGLN and clorfl2 are 0.015 and 0.0074 respectively, i.e. EGLN 
being the dominant gene. In contrast, in the neuroblastoma cell line SH-SY5Y, the normalised expression 
5 values of EGLN and clorfl2 after 6hr hypoxia are 0.0012 and 0.108 respectively, i.e. clorfl2 being the 
dominant gene by a large margin. This data demonstrates that clORF12 and EGLN3 are not 
constitutively expressed at an equal amount in different tissues indicating specificity of function, 
Therefore therapeutic products may be developed based on this data, with the goal of modulating proline 
hydroxylation of target proteins (such as HIFlalpha) in specific tissues, based on the differing expression 
10 profile of clORF12 and EGLN3 in those tissues. 

The Oxford BioMedica clone p2Bl represents PRAME. The protein sequence encoded by PRAME is 
represented in the public databases by the accession NP„006 1 06 and is described in this patent by Seq ID 
87. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_0061 15 and is described in this patent by Seq ID 88, Hypoxia is an important feature of several 

15 diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Hypoxia is frequently found in 
human tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian 
or breast cancer, PRAME is up-regulated in the malignant tissue as compared to adjacent normal tissue in 
at least one patient. PRAMEjs a well-known tumour-associated antigen. Our surprising demonstration of 

20 its hypoxia-regulation provides for an important diagnostic test to distinguish false-positive results. In 
addition, we show the relevance of PRAME to hypoxia-related functions of tumours such as 
angiogenesis. 

The Oxford BioMedica clones plD 17 and plP14 represent Semaphorin 4b. The protein sequence 
encoded by Semaphorin 4b is represented in the public databases by the accession BAB21836 and is 

25 described in this patent by Seq ID 91. The nucleotide sequence is represented in the public sequence 
databases by the accession AB051532 and is described in this patent by Seq ID 92. Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. The 
response of renal epithelial cells to hypoxia is pertinent to kidney failure, especially regarding the 

30 medullary tissue. Semaphorin 4b is preferentially induced by hypoxia in renal epithelial cells. 
Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 
these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 
sites, so gene expression responses to both hypoxia and cytokines are especially relevant. Semaphorin 4b 
is induced in macrophages activated by LPS and gamma interferon.. Semaphorin 4b is also induced by 
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the the presence of reactive oxygen species. We expect it to have a pro-inflammatory role, and its 
inhibition may have an anti-inflammatory effect. We have cited elsewhere in this specification that a 
plexin is hypoxia-regulated, and we propose a functional relationship between these two molecules. 
Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In a series of 
5 5 patients with either ovarian or breast cancer, Semaphorin 4b is up-regulated in the malignant tissue as 
compared to adjacent normal tissue in at least one patient. Semaphorin 4b is also induced in response to 
superoxide radicals, as found in various disease states, implying utility. Semaphorin 4b is predicted to 
function in modulating several cellular processes key to human disease, including angiogenesis, 
inflammation, immune cell migration and tissue remodelling. Other Semaphorins including Semaphorin 
10 E, which are induced in response to hypoxia will also be implicated in these disease processes and have 
utility as described for Semaphorin 4b. 

The Oxford BioMedica clone plC24 represents SLC25A19. The protein sequence encoded by SLC25A19 
is represented in the public databases by the accession NP_068380 and is described in this patent by Seq 
ID 93. The nucleotide sequence is represented in the public sequence databases by the accession 
15 NM_021734 and is described in this patent by Seq ID 94. SLC25A19 transports deoxynucleotides into 
mitochondria and is therefore essential for mtDNA synthesis. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone plD3 represents Serine carboxypeptidase 1. The protein sequence encoded 
20 by Serine carboxypeptidase 1 is represented in the public databases by the accession NP.067639 and is 
described in this patent by Seq ID 95. The nucleotide sequence is represented in the public sequence 
databases by the accession NM.021626 and is described in this patent by Seq ID 96. Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
25 Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 
these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 
sites, so gene expression responses to both hypoxia and cytokines are especially relevant. Serine 
carboxypeptidase 1 is repressed in macrophages activated by LPS and gamma interferon. TNFalpha is an 
inflammatory cytokine, which acts on macrophages, and has been shown to be central to the 
30 pathophysiology and treatment of diseases including rheumatoid arthritis. Genes that change in 
expression in response to TNFalpha therefore have utility in the design of therapeutic, prognostic and 
diagnostic products for such inflammatory conditions. Serine carboxypeptidase 1 is induced in 
macrophages activated by TNFalpha. Increased serine carboxypeptidase activity in glial cells has been 
shown to result in neurological abnormalities, due to the degradation of essential neuro-active factors. 
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Similarly, peripheral neurological disease could result from such activity in macrophages, Our 
demonstration of hypoxia regulation of serine carboxypeptidase activity opens a route for diagnosis and 
treatment of these diseases. Hypoxia is frequently found in human tumours where macrophage infiltrates 
are also found. In a series of 5 patients with either ovarian or breast cancer, Serine carboxypeptidase 1 is 
5 down-regulated in the malignant tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone plE14 represents an unknown mRNA (schizophrenia-linked). The protein 
sequence encoded by the unknown mRNA (schizophrenia-linked) is not represented in the public 
databases by a protein accession. The nucleotide sequence is represented in the public sequence databases 
by the accession AY0101 12 and is described in this patent by Seq ID 98. Hypoxia is an important feature 

10 of several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis 
and have utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to 
several diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. TNFalpha is an inflammatory 

15 cytokine, which acts on macrophages, and has been shown to be central to the pathophysiology and 
treatment of diseases including rheumatoid arthritis. Genes that change in expression in response to 
TNFalpha therefore have utility in the design of therapeutic, prognostic and diagnostic products for such 
inflammatory conditions. Unknown mRNA (schizophrenia-linked) is induced in macrophages activated 
by TNFalpha. There are many enzymic activities that can give rise to neurological abnormalities, and 

20 their hypoxia regulation is pertinent to the diagnosis and treatment of such diseases, including 
schizophrenia. 

The Oxford BioMedica clone plE20 represents Myo-inositol monophosphatase A3. The protein sequence 
encoded by Myo-inositol monophosphatase A3 is represented in the public databases by the accession 
AAK52336 and is described in this patent by Seq ID 99. The nucleotide sequence is represented in the 

25 public sequence databases by the accession NM_017813 and is described in this patent by Seq ID 100. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. As refered to elsewhere in this specification, we have found several components of the 
phosphatidylinisotol second messenger system to be hypoxia-regulated, This system has profound effects 

30 which are relevant to many diseases with known associations with hypoxia and ischaemia. Local and 
transient ischaemia is relevant to such diseases as rheumatoid arthritis and atherosclerosis, and also 
potentially to such diseases as schizophrenia and bi-polar disorder. It is instructive that lithium, which is a 
well-recognised treatment for affective disorders, appears to operate via the phosphatidylinisotol system 
[Pettegrew et al 2001, Bipolar Disord 3:189-201]. Macrophages are key to several diseases involving 
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hypoxia, and contribute to inflammatory processes. In these, macrophages are frequently activated by 
cytokines, which have been shown to be present at disease sites, so gene expression responses to both 
hypoxia and cytokines are especially relevant. Myo-inositol monophosphatase A3 is repressed in 
macrophages activated by LPS and gamma interferon, 

5 The Oxford BioMedica clone p2A24 represents an unannotated EST, The protein sequence encoded by 
this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession AA521314 and is described in this patent 
by Seq ID 102. Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
10 prognostic and diagnostic products. 

The Oxford BioMedica clone plE17 represents Hypothetical protein FLJ31668. The protein sequence 
encoded by Hypothetical protein FLJ31668 is represented in the public databases by the accession 
BAB71124 and is described in this patent by Seq ID 103, The nucleotide sequence is represented in the 
public sequence databases by the accession AK056230 and is described in this patent by Seq ID 104. 
15 Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products,- 

The Oxford BioMedica clone p 1 E 19 represents an unannotated EST. The protein sequence encoded by 
this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 

20 represented in the public sequence databases by the accession R51835 and is described in this patent by 
Seq ID 106. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
diagnostic products. Hypoxia is frequently found in human tumours where macrophage infiltrates are also 
found. In a series of 5 patients with either ovarian or breast cancer, the EST represented by Seq ID 106 is 

25 up-regulated in the malignant tissue as compared to adjacent normal tissue in at least one patient, 

The Oxford BioMedica clone plEl 5 represents cDNA YI27F12. The protein sequence encoded by cDNA 
YI27F12 is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession AF075018 and is described in this patent 
by Seq ID 108. Hypoxia is an important feature of several diseases, and genes that respond to this 
30 stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products, The cDNA YI27F12 is induced in macrophages treated with the 
inhibitory cytokine IL-10. The cDNA YI27F12 is repressed in macrophages activated by IL-17. We 
expect the product of cDNA YI27F12 to have an anti-inflammatory role. 
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The Oxford BioMedica clone plEll represents an unannotated EST.- The protein sequence encoded by 
this EST is not represented in the public databases by a protein accession, The nucleotide sequence is 
represented in the public sequence databases by the accession R69248 and is described in this patent by 
Seq ID 110, Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
5 are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
diagnostic products, 

The Oxford BioMedica clone plE23 represents cDNA FLJ14041 fis, clone HEMB A1005780. The 
protein sequence encoded by cDNA FLJ14041 fis, clone HEMBA1005780 is not represented in the 
public databases by a protein accession. The nucleotide sequence is represented in the public sequence 
10 databases by the accession AK024103 and is described in this patent by Seq ID 112. Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone plE21 represents Glutamate-cysteine ligase, modifier subunit. The protein 
sequence encoded by Glutamate-cysteine ligase, modifier subunit is represented in the public databases 

15 by the accession NP.002052 and is described in this patent by Seq ID 113. The nucleotide sequence is 
represented in the public sequence databases by the accession NM_002061 and is described in this patent 
by Seq ID 114. Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. Glutamate-cysteine ligase is the rate-limiting enzyme of glutathione 

20 synthesis, and this enzyme is relevant to cell survival under stress. Hypoxia is frequently found in human 
tumours where macrophage infiltrates are also found, In a series of 5 patients with either ovarian or breast 
cancer, Glutamate-cysteine ligase, modifier subunit is up-regulated in the malignant tissue as compared to 
adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone p 1D23 represents PTEN. The protein sequence encoded by PTEN is 
25 represented in the public databases by the accession NP_000305 and is described in this patent by Seq ID 
115, The nucleotide sequence is represented in the public sequence databases by the accession 
NM_000314 and is described in this patent by Seq ID 116, PTEN is a member of the mixed function, 
serine/threonine/tyrosine phosphatase subfamily of protein phosphatases. Its physiological substrates, 
however, are primarily 3-phosphorylated inositol phospholipids, which are products of phosphoinositide 
30 3-kinases [Downes et al 2001, Biochem Soc Trans 29:846-51], Hypoxia-regulation of this gene is a 
further element in the hypoxic regulation of this important second messenger system, Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products, 
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The Oxford BioMedica clone plD.24 represents an unannotated EST. The protein sequence encoded by 
this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession T73780 and is described in this patent by 
Seq ID 118. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
5 are therefore implicated in the pathogenesis and have utility in the design of therapeutic-, prognostic and 
diagnostic products. The response of renal epithelial cells to hypoxia is pertinent to kidney failure, 
especially regarding the medullary tissue. The EST represented by Seq ID 118 is preferentially induced 
by hypoxia in renal epithelial cells. Macrophages are key to several diseases involving hypoxia, and 
contribute to inflammatory processes. In these, macrophages are frequently activated by cytokines, which 
10 have been shown to be present at disease sites, so gene expression responses to both hypoxia and 
cytokines are especially relevant. The EST represented by Seq ID 118 is induced in macrophages 
activated by LPS and gamma interferon. 

The Oxford BioMedica clones plD22 and pi G 5 represent MAX-interacting protein 1. The protein 
sequence encoded by MAX-interacting protein 1 is represented in the public databases by the accession 

15 NP_005953 and is described in this patent by Seq ID 119 and 279, The nucleotide sequence is . 

represented in the public sequence databases by the accession NM_005962 and is described in this patent 
by Seq ID 120 and 280. MAX-interacting protein 1 is a negative regulator of myc oncoprotein with tumor 
suppressor properties. Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 

20 prognostic and diagnostic products. Macrophages are key to several diseases involving hypoxia, and 

contribute to inflammatory processes. In these, macrophages are frequently activated by cytokines, which 
have been shown to be present at disease sites, so gene expression responses to both hypoxia and 
cytokines are especially relevant. MAX-interacting protein 1 is repressed in macrophages activated by 
LPS and gamma interferon. 

25 The Oxford BioMedica clone plE2 represents Mannosidase, alpha, class 1A, member 1. The protein 
sequence encoded by Mannosidase, alpha, class 1A, member 1 is represented in the public databases by 
the accession NP_005898 and is described in this patent by Seq ID 121. The nucleotide sequence is 
represented in the public sequence databases by the accession NM_005907 and is described in this patent 
by Seq ID 122. Hypoxia is an important feature of several diseases, and genes that respond to this 

30 stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. Hypoxia is frequently found in human tumours where macrophage 
infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, Mannosidase, 
alpha, class 1A, member 1 is down-regulated in the malignant tissue as compared to adjacent normal 
tissue in at least one patient. 



WO 02/46465 



PCT/GB01/05458 



129 

The Oxford BioMedica clone plEl represents an unannotated EST. The protein sequence encoded by this 
EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession AA446361 and is described in this patent 
by Seq ID 124. Hypoxia is an important feature of several diseases, and genes that respond to this 
5 stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products, Macrophages are key to several diseases involving hypoxia, and 
contribute to inflammatory processes. In these, macrophages are frequently activated by cytokines, which 
have been shown to be present at disease sites, so gene expression responses to both hypoxia and 
cytokines are especially relevant. The EST represented by Seq ID 124 is repressed in macrophages 
10 activated by LPS and gamma interferon. 

The Oxford BioMedica clone plE4 represents an unannotated EST. The protein sequence encoded by this 
EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession AA931411 and is described in this patent 
by Seq ID 126. Hypoxia is an important feature of several diseases, and genes that respond to this 

15 stimulus are therefore implicated in. the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. Macrophages are key to several diseases involving hypoxia, and 
contribute to inflammatory processes. In these, macrophages are frequently activated by cytokines, which 
have been shown to be present at disease sites, so gene expression responses to both hypoxia and 
cytokines are especially relevant. The EST represented by Seq ID 126 is repressed in macrophages 

20 activated by LPS and gamma interferon. We expect this gene product to have an anti-inflammatory role. 
Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In a series of 
5 patients with either ovarian or breast cancer, the EST represented by Seq ID 126 is down-regulated in 
the malignant tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone plD 1 8 represents cDNA FLJ13443 fis, clone PLACE1002853. The protein 
25 sequence encoded by cDNA FLJ13443 fis, clone PLACE1002853 is not represented in the public 
databases by a protein accession. The nucleotide sequence is represented in the public sequence databases 
by the accession AK023505 and is described in this patent by Seq ID 128. Hypoxia is an important 
feature of several diseases, and genes that respond to this stimulus are therefore implicated in the 
pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
30 Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 
these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 
sites, so gene expression responses to both hypoxia and cytokines are especially relevant. The cDNA 
FLJ13443 fis, clone PLACE1002853 is repressed in macrophages activated by LPS and gamma 
interferon. We expect it to have an anti-inflammatory role. 
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The Oxford BioMedica clone plD21 represents Hypothetical protein FLJ22622. The protein sequence 
encoded by Hypothetical protein FLJ22622 is represented in the public databases by the accession 
BAB 15424 and is described in this patent by Seq ID 129. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_025 151 and is described in this patent by Seq ID 130, 
5 Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products, The response of renal epithelial cells to hypoxia is pertinent to kidney failure, especially 
regarding the medullary tissue, Hypothetical protein FLJ22622 is preferentially induced by hypoxia in 
renal epithelial cells. Macrophages are key to several diseases involving hypoxia, and contribute to 

10 inflammatory processes. In these, macrophages are frequently activated by cytokines, which have been 
shown to be present at disease sites, so gene expression responses to both hypoxia and cytokines are 
especially relevant. Hypothetical protein FLJ22622 is repressed in macrophages activated by LPS and 
gamma interferon. We expect it to have an anti-inflammatory role. Hypoxia is frequently found in human 
tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast 

15 cancer, Hypothetical protein FLJ22622 is up-regulated in the malignant tissue as compared to adjacent 
normal tissue in at least one patient, 

The Oxford BioMedica clone plC22 represents CD84-H1. The protein sequence encoded by CD84-H1 is 
represented in the public databases by the accession AAK69052 and is described in this patent by Seq ID 
131. The nucleotide sequence is represented in the public sequence databases by the accession AF275725 
20 and is described in this patent by Seq ID 132. Hypoxia is an important feature of several diseases, and 
genes that respond to this stimulus are therefore implicated in the pathogenesis and have utility in the 
design of therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone plC23 represents Hypothetical protein FLJ12832, The protein sequence 
encoded by Hypothetical protein FLJ12832 is represented in the public databases by the accession 

25 XP_043394 and is described in this patent by Seq ID 133. The nucleotide sequence is represented in the 
public sequence databases by the accession AK022894 and is described in this patent by Seq ID 134. 
Hypothetical protien FLJ12832 is a putative ubiquitin as it shows high structural similarity to ubiquitin C 
and contains a ubiquitin domain. Hypoxia is an important feature of several diseases, and genes that 
respond to this stimulus are therefore implicated in the pathogenesis and have utility in the design of 

30 therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone plDll represents an unannotated EST. The protein sequence encoded by 
this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession AA251748 and is described in this patent 
by Seq ID 136. Hypoxia is an important feature of several diseases, and genes that respond to this 
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stimulus are therefore implicated in the pathogenesis and have, utility in the design of therapeutic, 
prognostic and diagnostic products, 

The Oxford BioMedica clones plE3 and plF16 represent CYP1B1. The protein sequence encoded by 
CYP1B1 is represented in the public databases by the accession NP_000095 and is described in this 
5 patent by Seq ID 137 and 325. The nucleotide sequence is represented in the public sequence databases 
by the accession NM_000104 and is described in this patent by Seq ID 138 and 326. Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. There is 
a prejudice in the art that the response to hypoxia is generic to all cell types. Contrary to this, we show 

10 that genes are regulated by hypoxia to a greater degree in certain cell types, substantiating their utility in 
designing specific therapeutic products for diseases involving those cell types. Monocytes and 
macrophages have been implicated in the following diseases involving hypoxia: rheumatoid arthritis, 
atherosclerosis, cancer, COPD. CYP1B1 is preferentially induced by hypoxia in monocytes or 
macrophages and a restricted number of other cell types, Macrophages are key to several diseases 

15 involving hypoxia, and contribute to inflammatory processes. In these, macrophages are frequently 
activated by cytokines, which have been shown to be present at disease sites, so gene expression 
responses to both hypoxia and cytokines are especially relevant. CYP1B1 is repressed in macrophages 
activated by LPS and gamma interferon. TNFalpha is an inflammatory cytokine, which acts on 
macrophages, and has been shown to be central to the pathophysiology and treatment of diseases 

20 including rheumatoid arthritis. Genes that change in expression in response to TNFalpha therefore have 
utility in the design of therapeutic, prognostic and diagnostic products for such inflammatory conditions. 
CYP1B1 is induced in macrophages activated by TNFalpha. Hypoxia is frequently found in human 
tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast 
cancer, CYPlBlis up-regulated and also down regulated in the malignant tissue as compared to adjacent 

25 normal tissue in at least one patient. 

The Oxford BioMedica clone plD20 represents Hypothetical protein KIAA1 125. The protein sequence 
encoded by Hypothetical protein KIAA1 125 is represented in the public databases by the accession 
XP_012932 and is described in this patent by Seq ID 139. The nucleotide sequence is represented in the 
public sequence databases by the accession AB032951 and is described in this patent by Seq ID 140. 
30 Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 

The Oxford BioMedica clone plE5 represents Hepcidin antimicrobial peptide. The protein sequence 
encoded by Hepcidin antimicrobial peptide is represented in the public databases by the accession 
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NP_066998 and is described in this patent by Seq ID 141. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_021 175 and is described in this patent by Seq ID 142, 
Hypoxia is an important feature, of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
5 products, Hepatocytes are the main cell type of the liver and genes that are induced in response to hypoxia 
in this cell type are relevant to development of diagnostics and therapeutics towards liver diseases 
involving hypoxia, including cirrhosis. Hepcidin antimicrobial peptide is preferentially induced by 
hypoxia in hepatocytes. Hepcidin antimicrobial peptide is induced in macrophages treated with the 
inhibitory cytokine IL-10. TNFalpha is an inflammatory cytokine, which acts on macrophages, and has 
10 been shown to be central to the pathophysiology and treatment of diseases including rheumatoid arthritis. 
Genes that change in expression in response to TNFalpha therefore have utility in the design of 
therapeutic, prognostic and diagnostic products for such inflammatory conditions. Hepcidin antimicrobial 
peptide is repressed in macrophages activated by TNFalpha, 

The Oxford BioMedica clone plD19 represents an unannotated EST, The protein sequence encoded by 
15 this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession R68736 and is described in this patent by 
Seq ID 144. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
diagnostic products. Macrophages are key to several diseases involving hypoxia, and contribute to 
20 inflammatory processes. In these, macrophages are frequently activated by cytokines, which have been 
shown to be present at disease sites, so gene expression responses to both hypoxia and cytokines are 
especially relevant, The EST represented by Seq ID 144 is induced in macrophages activated by LPS and 
gamma interferon and. is also induced in macrophages activated by IL-15. We expect the gene product 
relevant to the EST represented by Seq ID 144 to have a pro-inflammatory role, and its inhibition may 
25 have an anti-inflammatory effect. 

The Oxford BioMedica clone p2A 1 5 represents Sialyltransferase. The protein, sequence encoded by 
Sialyltransferase is represented in the public databases by the. accession NP_006447 and is described in 
this patent by Seq ID 145. The nucleotide sequence-is represented in the public sequence databases by the 
accession NM_006456 and is described in this patent by Seq ID 146. Hypoxia is an important feature of 
30 several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 
have utility in the design of therapeutic, prognostic and diagnostic products, 

The Oxford BioMedica clone p 1 II 4 represents cDNA DKFZp564D016. The protein sequence encoded by 
cDNA DKFZp564D016 is not represented in the public databases by a protein accession. The nucleotide 
sequence is represented in the public sequence databases by the accession AL050021 and is described in 
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this patent by Seq ID 148, Hypoxia is an important feature of several diseases, and genes that respond to 
this stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. 

The Oxford BioMedica clone p 1 12 represents cDNA FLJ11302 fis, clone PLACE100997L The protein 
5 sequence encoded by cDNA FLJ11302 fis, clone PLACE1009971 is not represented in the public 
databases by a protein accession, The nucleotide sequence is represented in the public sequence databases 
by the accession AK002164 and is described in this patent by Seq ID 150. Hypoxia is an important 
feature of several diseases, and genes that respond to this stimulus are therefore implicated in the 
pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
10 Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 
these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 
sites, so gene expression responses to both hypoxia and cytokines are especially relevant. The cDNA 
FLJ1 1302 fis, clone PLACE1009971 is repressed in macrophages activated by LPS and gamma 
interferon, We expect it to have an anti-inflammatory role. 

15 The Oxford BioMedica clone pi II 2 represents Hypothetical protein MGC4549. The protein sequence 
encoded by Hypothetical protein MGC4549 is represented in the public databases by the accession 
XP_032794 and is described in this patent by Seq ID 151. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_032377 and is described in this patent by Seq ID 152. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 

20 implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. Hypothetical protein MGC4549 is induced in macrophages treated with the inhibitory cytokine 
IL-10. Hypothetical protein MGC4549 is repressed in macrophages activated by IL-17 and is also 
repressed in macrophages activated by IL-15. We expect it to have an anti-inflammatory role. 

The Oxford BioMedica clone p 1 13 represents ELM02. The protein sequence encoded by ELM02 is 
25 represented in the public databases by the accession AAL14467 and is described in this patent by Seq ID 
153, The nucleotide sequence is represented in the public sequence databases by the accession 
XM_012933 and is described in this patent by Seq ID 154, Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. This gene has been shown recently 
30 to promote phagocytosis and cell shape changes [Gumienny et al 2001, Cell 107:27-41]. These functions 
are typical of the macrophage, and are likely to play a role in macrophage-associated diseases. 

The Oxford BioMedica clone plUO represents an unannotated EST. The protein sequence encoded by 
this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
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represented in the public sequence databases by the accession AA420992 and is described in this patent 
by Seq ID 156, Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. 

5 The Oxford BioMedica clone plH18 represents Ubiquitin specific protease 7. The protein sequence 
encoded by Ubiquitin specific protease 7 is represented in the public databases by the accession 
NP_003461 and is described in this patent by Seq ID 157. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_003470 and is described in this patent by Seq ID 158. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
10 implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In 
a series of 5 patients with either ovarian or breast cancer, Ubiquitin specific protease 7 is up-regulated in 
the malignant tissue as compared to adjacent normal tissue in at least one patient. We expect decreased 
activity of the gene product to have an anti-tumour effect. 

15 The Oxford BioMedica clone plH24 represents Nucleolar phosphoprotein Nopp34. The protein sequence 
encoded by Nucleolar phosphoprotein Nopp34 is represented in the public databases by the accession 
NP_1 15766 and is described in this patent by Seq ID 159. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_032390 and is described in this patent by Seq ID 160. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 

20 implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 

The Oxford BioMedica clone plE22 represents cDNA FLJ13618 fis, clone PLACE1010925. The protein 
sequence encoded by cDNA FLJ13618 fis, clone PLACE1010925 is not represented in the public 
databases by a protein accession. The nucleotide sequence is represented in the public sequence databases 

25 by the accession AK023680 and is described in this patent by Seq ID 162. Hypoxia is an important 
feature of several diseases, and genes that respond to this stimulus are therefore implicated in the 
pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 
these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 

30 sites, so gene expression responses to both hypoxia and cytokines are especially relevant. The cDNA 
FLJ1361 8 fis, clone PLACE1010925 is induced in macrophages activated by LPS and gamma interferon. 

The Oxford BioMedica clone plH21 represents Hypothetical protein FLJ13511. The protein sequence 
encoded by Hypothetical protein FLJ13511 is represented in the public databases by the accession 



WO 02/46465 



PCT/GB01/05458 



135 

NP_1 490 14 and is described in this patent by Seq ID 163. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_033025 and is described in this patent by Seq ID 164, 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
5 products. There is a prejudice in the art that the response to hypoxia is generic to all cell types. Contrary 
to this, we show that genes are regulated by hypoxia to a greater degree in certain cell types, 
substantiating their utility in designing specific therapeutic products for diseases involving those cell 
.types. Monocytes and macrophages have been implicated in the following diseases involving hypoxia: 
rheumatoid arthritis, atherosclerosis, cancer, COPD. Hypothetical protein FLJ13511 is preferentially 
10 induced by hypoxia in monocytes or macrophages and a restricted number of other cell types, Hypoxia is 
frequently found in human tumours where macrophage infiltrates are also found. In a series of 5 patients 
with either ovarian or breast cancer, Hypothetical protein FLJ13511 is down-regulated in the malignant 
tissue as compared to adjacent normal tissue in at least one patient, 

The Oxford BioMedica clone pill represents Ribosomal RNA intergenic spacer, The protein sequence 
15 encoded by Ribosomal RNA intergenic spacer is not represented in the public databases by a protein 
accession, The nucleotide sequence is represented in the public sequence databases by the accession 
AA664228 and is described in this patent by Seq ID 166, Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. 

20 The Oxford BioMedica clone plH14 represents an unannotated EST. The protein sequence encoded by 
this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession R44397 and is described in this patent by 
Seq ID 168. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 

25 diagnostic products, 

The Oxford BioMedica clone plHll represents Carboxypeptidase M. The protein sequence encoded by 
Carboxypeptidase M is represented in the public databases by the accession NP_001865 and is described 
in this patent by Seq ID 169, The nucleotide sequence is represented in the public sequence databases by 
the accession NM_001874 and is described in this patent by Seq ID 170. Hypoxia is an important feature 
30 of several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis 
and have utility in the design of therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone pi H 17 represents an unannotated EST. The protein sequence encoded by 
this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
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represented in the public sequence databases by the accession W 87747 and is described in this patent by 
Seq ID 172, Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
diagnostic products, Hypoxia is frequently found in human tumours where macrophage infiltrates are also 
5 found. In a series of 5 patients with either ovarian or breast cancer, the EST represented by Seq ID 172 is 
up-regulated and also down-regulated in the malignant tissue as compared to adjacent normal tissue in at 
least one patient, 

The Oxford BioMedica clone plH 12 represents an unannotated EST. The protein sequence encoded by 
this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
10 represented in the public sequence databases by the accession AA973568 and is described in this patent 
by Seq ID 174, Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products, 

The Oxford BioMedica clone plH7 represents an unannotated EST. The protein sequence encoded by this 
15 EST is not represented in* the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession T98529 and is described in this patent by 
Seq ID 176. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
diagnostic products. 

20 The Oxford BioMedica clone plH15 represents an unannotated EST. The protein sequence encoded by 
this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession AA022679 and is described in this patent 
by Seq ID 178, Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 

25 prognostic and diagnostic products. Hypoxia is frequently found in human tumours where macrophage 
infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, the EST represented 
by Seq ID 178 is up-regulated and also down-regulated in the malignant tissue as compared to adjacent 
norm a! tissue in at least one patient. 

The Oxford BioMedica clone plH20 represents an unannotated EST. The protein sequence encoded by 
30 EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession H17921 and is described in this patent by 
Seq ID 180. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
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diagnostic products. Hypoxia is frequently found in human tumours where macrophage infiltrates are also 
found. In a series of 5 patients with either ovarian or breast cancer, the EST represented by Seq ID 180 is 
up-regulated in the malignant tissue as. compared to adjacent normal tissue in at least one patient. We 
expect decreased activity of the gene product to have an anti-tumour effect. 

5 The Oxford BioMedica clone plH8 represents ABL. The protein sequence encoded by ABL is 
represented in the public databases by the accession NP_009297 and is described in this patent by Seq ID 
181. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_007313 and is described in this patent by Seq ID 182. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 

10 utility in the design of therapeutic, prognostic and diagnostic products, ABL is induced in macrophages 
treated with the inhibitory cytokine IL-10. ABL is repressed in macrophages activated by IL-17 and is 
also repressed in macrophages activated by IL-15. We expect it to have an anti-inflammatory role 
Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In a series of 
5 patients with either ovarian or breast cancer, ABL is up-regulated in the malignant tissue as compared 

15 to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone plH16 represents an unannotated EST. The protein sequence encoded by 
this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession W91958 and is described in this patent by 
Seq ID 184. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
20 are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
, diagnostic products. Hypoxia is frequently found in human tumours where macrophage infiltrates are also 
found. In a series of 5 patients with either ovarian or breast cancer, the EST represented by Seq ID 184 is 
up-regulated in the malignant tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone plH9 represents an unannotated EST. The protein sequence encoded by this 
25 EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession R63694 and is described in this patent by 
Seq ID 186. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
diagnostic products. 

30 The Oxford BioMedica clone plH23 represents Hypothetical protein FLJ21094. The protein sequence 
encoded by Hypothetical protein FLJ21094 is represented in the public databases by the accession 
AAH14003 and is described in this patent by Seq ID 187. The nucleotide sequence is represented in the 
public sequence databases by the accession AK024747 and is described in this patent by Seq ID 188. 
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Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 

The Oxford BioMedica clone plHIO represents an unannotated EST, The protein sequence encoded by 
5 this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession AA909912 and is described in this patent 
by Seq ID 190. Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. 

10 The Oxford BioMedica clone plH6 represents an unannotated EST. The protein sequence encoded by this 
EST is not represented in the public databases by a protein accession* The nucleotide sequence is 
represented in the public sequence databases by the accession T99032 and is described in this patent by 
Seq ID 192. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 

15 diagnostic products. The EST represented by Seq ID 192 is induced in macrophages treated with the 
inhibitory cytokine IL-10. The EST represented by Seq ID 192 is repressed in macrophages activated by 
IL-15. We expect it to have an anti-inflammatory role. 

The Oxford BioMedica clone p 1 H 1 3 represents an unannotated EST. The protein sequence encoded by 
this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 

20 represented in the public sequence databases by the accession H52503 and is described in this patent by 
Seq ID 194. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
diagnostic products. Macrophages are key to several diseases involving hypoxia, and contribute to 
inflammatory processes. In these, macrophages are frequently activated by cytokines, which have been 

25 shown to be present at disease sites, so gene expression responses to both hypoxia and cytokines are 
especially relevant. The EST represented by Seq ID 194 is repressed in macrophages activated by LPS 
and gamma interferon. 

The Oxford BioMedica clone p 1 H 1 9 represents an unannotated EST, The protein sequence encoded by 
this EST is not represented in the public databases by a protein accession, The nucleotide sequence is 
30 represented in the public sequence databases by the accession AA 127017 and is described in this patent 
by Seq ID 196. Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. Hypoxia is frequently found in human tumours where macrophage 
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infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, the EST represented 
by the Seq ID 196 is down-regulated in the malignant tissue as compared to adjacent normal tissue in at 
least one patient, 

The Oxford BioMedica clone plG22 represents an unannotated EST, The protein sequence encoded by 
5 this EST is not represented in the public databases by a protein accession, The nucleotide sequence is 
represented in the public sequence databases by the accession R38647 and is described in this patent by 
Seq ID 198. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
diagnostic products. Endothelial cells are key to angiogenesis, a process implicated in several diseases 
10 associated with hypoxia, including cancer and rheumatoid arthritis. The EST represented by Seq ID 198 is 
preferentially induced by hypoxia in endothelial cells, We expect this gene product to have a pro- 
angiogenic effect, and its inhibition to have an anti-angiogenic effect. 

The Oxford BioMedica clone pi G21 represents an unannotated EST, The protein sequence encoded by 
this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
15 represented in the public sequence databases by the accession T87233 and is described in this patent by 
Seq ID 200. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
diagnostic products. 

The Oxford BioMedica clone plHl represents Hypothetical protein FLJ10826. The protein sequence 
20 encoded by Hypothetical protein FLJ1 0826 is represented in the public databases by the accession 
BAB 14226 and is described in this patent by Seq ID 201. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_01 8233 and is described in this patent by Seq ID 202, 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility, in the design of therapeutic, prognostic and diagnostic 
25 products. 

The Oxford BioMedica clone plG20 represents cDNA YO23H03. The protein sequence encoded by 
cDNA YO23H03 is not represented in the public databases by a protein accession. The nucleotide 
sequence is represented in the public sequence databases by the accession AF075053 and is described in 
this patent by Seq ID 204. Hypoxia is an important feature of several diseases, and genes that respond to 
30 this stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. Macrophages are key to several diseases involving hypoxia, and 
contribute to inflammatory processes. In these, macrophages are frequently activated by cytokines, which 
have been shown to be present at disease sites, so gene expression responses to both hypoxia and 
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cytokines are especially relevant. The cDNA YO23H03 is repressed in macrophages activated by LPS 
and gamma interferon. 

The Oxford BioMedica clone plH5 represents Hypothetical protein FLJ22690. The protein sequence 
encoded by Hypothetical protein FLJ22690 is represented in the public databases by the accession 
5 NP_078987 and is described in this patent by Seq ID 205. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_02471 1 and is described in this patent by Seq ID 206. 
Hypoxia is an important feature of several diseases-, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. Endothelial cells are key to angiogenesis, a process implicated in several diseases associated 

10 with hypoxia, including cancer and rheumatoid arthritis. Hypothetical protein FLJ22690 is preferentially 
induced by hypoxia in endothelial cells. We expect this gene product to have a pro-angiogenic effect, and 
its inhibition to have an anti-angiogenic effect. Macrophages are key to several diseases involving 
hypoxia, and contribute to inflammatory processes. In these, macrophages are frequently activated by 
cytokines, which have been shown to be present at disease sites, so gene expression responses to both 

15 hypoxia and cytokines are especially relevant. Hypothetical protein FLJ22690 is induced in macrophages 
activated by IL-15. 

The Oxford BioMedica clone p 1 G 1 9 represents Mitochondrion sequence. The protein sequence encoded 
by Mitochondrion sequence is represented in the public databases by the accession AAH05845 and is 
described in this patent by Seq ID 207. The nucleotide sequence is represented in the public sequence 

20 databases by the accession BC005845 and is described in this patent by Seq ID 208. Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In a series of 
5 patients with either ovarian or breast cancer, the Mitochondrion sequence represented by Seq ID 208 is 

25 down-regulated in the malignant tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone plH2 represents Fatty acid binding protein 5. The protein sequence 
encoded by Fatty acid binding protein 5 is represented in the public databases by the accession 
NP_001435 and is described in this patent by Seq ID 209. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_001444 and is described in this patent by Seq ID 210. 
30 Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. There is a prejudice in the art that the response to hypoxia is generic to all cell types. Contrary 
to this, we show that genes are regulated by hypoxia to a greater degree in certain cell types, 
substantiating their utility in designing specific therapeutic products for diseases involving those cell 
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types. Monocytes and macrophages have been implicated in the following diseases involving hypoxia: 
rheumatoid arthritis, atherosclerosis, cancer, COPD. Fatty acid binding protein 5 is preferentially induced 
by hypoxia in monocytes or macrophages. Crucially and very recently, Fatty acid binding protein 5 
expressed in macrophages has been shown to play a very important role in the development of 
5 atherosclerotic plaques [Layne etal 2001, FASEB J 15:2733-5], Our demonstration of hypoxic-regulation 
of this gene not only makes clear how this gene can participate in disease initiation and progression, but 
provides for a potential route to diagnosis and therapy of atherosclerosis. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 

10 expression responses to both hypoxia and cytokines are especially relevant. TNFalpha is an inflammatory 
cytokine, which acts on macrophages, and has been shown to be central to the pathophysiology and 
treatment of diseases including rheumatoid arthritis. Genes that change in expression in response to 
TNFalpha therefore have utility in the design of therapeutic, prognostic and diagnostic products for such 
inflammatory conditions. Fatty acid binding protein 5 is repressed in macrophages activated by 

15 TNFalpha. 

The Oxford BioMedica clone p 1 G 1 8 represents Mitochondrion sequence. The protein sequence encoded 
by Mitochondrion sequence is not represented in the public databases by a protein accession. The 
nucleotide sequence is represented in the public sequence databases by the accession BC001612 and is 
described in this patent by Seq ID 212. Hypoxia is an important feature of several diseases, and genes that 

20 respond to this stimulus are therefore implicated in the pathogenesis and have utility in the design of 
therapeutic, prognostic and diagnostic products. Macrophages are key to several diseases involving 
hypoxia, and contribute to inflammatory processes. In 'these, macrophages are frequently activated by 
cytokines, which have been shown to be present at disease sites, so gene expression responses to both 
hypoxia and cytokines are especially relevant. The Mitochondrion sequence represented by Seq ID 212 is 

25 repressed in macrophages activated by LPS and gamma interferon. 

The Oxford BioMedica clone plH4 represents an unannotated EST. The protein sequence encoded by this 
EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession AA679939 and is described in this patent 
by Seq ID 214. Hypoxia is an important feature of several diseases, and genes that respond to this 
30 stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. Macrophages are key to several diseases involving hypoxia, and 
contribute to inflammatory processes. In these, macrophages are frequently activated by cytokines, which 
have been shown to be present at disease sites, so gene expression responses to both hypoxia and 
cytokines are especially relevant. The EST represented by Seq ID 214 is repressed in macrophages 
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activated by IL-17. We expect it to have an anti-inflammatory role. Hypoxia is frequently found in human 
tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast 
cancer, the EST represented by the Seq ID 214 is down-regulated in the malignant tissue as compared to 
adjacent normal tissue in at least one patient. 

5 The Oxford BioMedica clone plH3 represents an unannotated EST. The protein sequence encoded by this 
EST is not represented - in the public databases by a protein ' accession. The nucleotide sequence is 
represented in the public sequence databases by the accession AA630167 and is described in this patent 
by Seq ID 216. Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
10 prognostic and diagnostic products. Hypoxia is frequently found in human tumours where macrophage 
infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, the EST represented 
by Seq ID 216 is up-regulated and also down-regulated in the malignant tissue as compared to adjacent 
normal tissue in at least one patient. 

The protein sequence encoded by BCL2/adenovirus E1B 19kD-interacting protein 3-like is represented in 
15 the public databases by the accession NP_004322 and is described in this patent by Seq ID 217. The 
nucleotide sequence is represented in the public sequence databases by the accession NM_004331 and is 
described in this patent by Seq ID 218. Hypoxia is an important feature of several diseases, and genes that 
respond to this stimulus are therefore implicated in the pathogenesis and have utility in the design of 
therapeutic, prognostic and diagnostic products. 

20 The protein sequence encoded by SLC2A1 is represented in the public databases by the accession 
NP_006507 and is described in this patent by Seq ID 219. The nucleotide sequence is represented in the 
public sequence databases by the accession NM__006516 and is described in this patent by Seq ID 220. 
SLC2A1 is a glucose transporter gene and is also known as GLUT1. Hypoxia is an important feature of 
several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 

25 have utility in the design of therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone plP3 represents PDGFB. The protein sequence encoded by PDGFB is 
represented in the public databases by the accession NP_148937 and is described in this patent by Seq ID 
221. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_033016 and is described in this patent by Seq ID 222. Hypoxia is an important feature of several 
30 diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
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expression responses to both hypoxia and cytokines are especially relevant. PDGFB is induced in 
macrophages activated by LPS and gamma interferon, 

The Oxford BioMedica clones pi A8 and pi A9 represent Lactate dehydrogenase A. The protein sequence 
encoded by Lactate dehydrogenase A is represented in the public databases by the accession NP_005557 
5 and is described in this patent by Seq ID 223, The nucleotide sequence is represented in the public 
sequence databases by the accession NM_005566 and is described in this patent by Seq ID 224, Hypoxia 
is an important feature of several diseases, and genes that respond to this stimulus are therefore implicated 
in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products, 
Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 

10 these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 
sites, so gene expression responses to both hypoxia and cytokines are especially relevant, Lactate 
dehydrogenase A is repressed in macrophages activated by LPS and gamma interferon and is also 
repressed in macrophages activated by IL-15. TNFalpha is an inflammatory cytokine, which acts on 
macrophages, and has been shown to be central to the pathophysiology and treatment of diseases 

15 including rheumatoid arthritis, Genes that change in expression in response to TNFalpha therefore have 
utility in the design of therapeutic, prognostic and diagnostic products for such inflammatory conditions. 
Lactate dehydrogenase A is induced in macrophages activated by TNFalpha, Hypoxia is frequently found 
in human tumours where macrophage infiltrates are also found, In a series of 5 patients with either 
ovarian or breast cancer, Lactate dehydrogenase A is up-regulated in the malignant tissue as compared to 

20 adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone p 1 B 1 7 represents Tissue factor. The protein sequence encoded by Tissue 
factor is represented in the public databases by the accession NP_001984 and is described in this patent 
by Seq ID 225. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_001993 and is described in this patent by Seq ID 226. Hypoxia is an important feature of several 

25 diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Tissue factor is preferentially 
induced by hypoxia in mammary epithelial cells. Macrophages are key to several diseases involving 
hypoxia, and contribute to inflammatory processes, In these, macrophages are frequently activated by 
cytokines, which have been shown to be present at disease sites, so gene expression responses to both 

30 hypoxia and cytokines are especially relevant. TNFalpha is an inflammatory cytokine, which acts on 
macrophages, and has been shown to be central to the pathophysiology and treatment of diseases 
including rheumatoid arthritis. Genes that change in expression in response to TNFalpha therefore have 
utility in the design of therapeutic, prognostic and diagnostic products for such inflammatory conditions. 
Tissue factor is induced in macrophages activated by TNFalpha, Tissue factor is the primary initiator of 
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blood coagulation with structural homology to the cytokine receptor family, and has been implicated in 
various vascular processes including metastasis, angiogenesis, and atherosclerosis. Our demonstration of 
hypoxic regulation leads to a clear undertanding of the possibility of intervention in disease by 
modulation of Tissue factor activity. 

5 The Oxford BioMedica clone plO20 represents VEGF. The protein sequence encoded by VEGF is 
represented in the public databases by the accession NP_003367 and is described in this patent by Seq ID 
227. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_003376 and is described in this patent by Seq ID 228. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 

10 utility in the design of therapeutic, prognostic and diagnostic products. Hypoxia is frequently found in 
human tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian 
or breast cancer, VEGF is up-regulated in the malignant tissue as compared to adjacent normal tissue in at 
least one patient. 

The Oxford BioMedica clone plB2 represents N-myc downstream regulated. The protein sequence 
15 encoded by N-myc downstream regulated is represented in the public databases by the accession 
NP.006087 and is described in this patent by Seq ID 229. The nucleotide sequence is represented in the 
public sequence databases by the accession NM.006096 and is described in this patent by Seq ID 230. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
20 products. N-myc downstream regulated is preferentially induced by hypoxia in mammary epithelial cells. 

The Oxford BioMedica clone plB3 represents Proline 4-hydroxylase, alpha polypeptide I. The protein 
sequence encoded by Proline 4-hydroxylase, alpha polypeptide I is represented in the public databases by 
the accession NPJ00908 and is described in this patent by Seq ID 231. The nucleotide sequence is 
represented in the public sequence databases by the accession NM_000917 and is described in this patent 

25 by Seq ID 232. Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. Macrophages are key to several diseases involving hypoxia, and 
contribute to inflammatory processes. In these, macrophages are frequently activated by cytokines, which 
have been shown to be present at disease sites, so gene expression responses to both hypoxia and 

30 cytokines are especially relevant. Proline 4-hydroxylase, alpha polypeptide I is repressed in macrophages 
activated by LPS and gamma interferon, Hypoxia is frequently found in human tumours where 
macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, Proline 
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4-hydroxylase, alpha polypeptide I is down-regulated in the malignant tissue as compared to adjacent 
normal tissue in at least one patient 

The protein sequence encoded by BCL2/adenovirus El B -interacting protein 3 is represented in the public 
databases by the accession NP_004043 and is described in this patent by Seq ID 233. The nucleotide 
5 sequence is represented in the public sequence databases by the accession NM_004052 and is described 
in this patent by Seq ID 234, Hypoxia is an important feature of several diseases, and genes that respond 
to this stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. 

The Oxford BioMedica clones plB 18 and p 1 B 1 9 represent Plasminogen activator inhibitor, type 1. The 

10 protein sequence encoded by Plasminogen activator inhibitor, type 1 is represented in the public databases 
by the accession NP_000593 and is described in this patent by Seq ID 235. The nucleotide sequence is 
represented in the public sequence databases by the accession NM_000602 and is described in this patent 
by Seq ID 236. Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 

15 prognostic and diagnostic products. Macrophages are key to several diseases involving hypoxia, and 
contribute to inflammatory processes, In these, macrophages are frequently activated by cytokines, which 
have been shown to be present at disease sites, so gene expression responses to both hypoxia and 
cytokines are especially relevant. Plasminogen activator inhibitor, type 1 is induced in macrophages 
activated by LPS and gamma interferon, Plasminogen activator inhibitor, type 1 is repressed in 

20 macrophages activated by IL-17. TNFalpha is an inflammatory cytokine, which acts on macrophages, and 
has been shown to be central to the pathophysiology and treatment of diseases including rheumatoid 
arthritis. Genes that change in expression in response to TNFalpha therefore have utility in the design of 
therapeutic, prognostic and diagnostic products for such inflammatory conditions. Plasminogen activator 
inhibitor, type 1 is induced in macrophages activated by TNFalpha. Hypoxia is frequently found in human 

25 tumours where macrophage infiltrates are also found, In a series of 5 patients with either ovarian or breast 
cancer, Plasminogen activator inhibitor, type 1 is down-regulated in the malignant tissue as compared to 
adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone plN17 represents COX-2. The protein sequence encoded by COX-2 is 
represented in the public databases by the accession NP_000954 and is described in this patent by Seq ID 
30 237. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_000963 and is described in this patent by Seq ID 238. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. COX-2 is preferentially induced 
- by hypoxia in mammary epithelial cells. Macrophages are key to several diseases involving hypoxia, and 
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contribute to inflammatory processes. In these, macrophages are frequently activated by cytokines, which 
have been shown to be present at disease sites, so gene expression responses to both hypoxia and 
cytokines are especially relevant. COX-2 is induced in macrophages activated by LPS and gamma 
interferon. TNFalpha is an inflammatory cytokine, which acts on macrophages, and has been shown to be 
5 central to the pathophysiology and treatment of diseases including rheumatoid arthritis. Genes that change 
in expression in response to TNFalpha therefore have utility in the design of therapeutic, prognostic and 
diagnostic products for such inflammatory conditions. COX-2 is induced in macrophages activated by 
TNFalpha. In view of the known role of COX-2 in prostaglandin synthesis and tumour progression, its 
induction by hypoxia has profound clinical implications, and clear utility in diagnosis and therapy. 
10 Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. 

The Oxford BioMedica clone pi A24 represents Metallothionein 1H. The protein sequence encoded by 
Metallothionein 1H is represented in the public databases by the accession NP_005942 and is described in 
this patent by Seq ID 239. The nucleotide sequence is represented in the public sequence databases by the 
accession NM_005951 and is described in this patent by Seq ID 240. Metallothioneins can act as an 

15 antioxidant and free-radical scavenger and are therefore protective against cell death in hypoxia. Hypoxia 
is an important feature of several diseases, and genes that respond to this stimulus are therefore implicated 
in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
Hepatocytes are the main cell type of the liver and genes which are induced in response to hypoxia in this 
cell type are relevant to development of diagnostics and therapeutics towards liver diseases involving 

20 hypoxia, including cirrhosis. Metallothionein 1H is preferentially induced by hypoxia in hepatocytes. 
Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 
these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 
sites, so gene expression responses to both hypoxia and cytokines are especially relevant. Metallothionein 
1H is induced in macrophages activated by LPS and gamma interferon. 

25 The protein sequence encoded by Metallothionein 1L is represented in the public databases by the 
accession NP_002441 and is described in this patent by Seq ID 241. The nucleotide sequence is 
represented in the public sequence databases by the accession NM_002450 and is described in this patent 
by Seq ID 242. Metallothioneins can act as an antioxidant and free-radical scavenger and are therefore 
protective against cell death in hypoxia. Hypoxia is an important feature of several diseases, and genes 

30 that respond to this stimulus are therefore implicated in the pathogenesis and have utility in the design of 
therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone plBl represents Metallothionein 1G. The protein sequence encoded by 
Metallothionein 1G is represented in the public databases by the accession NP_005941 and is described in 
this patent by Seq ID 243. The nucleotide sequence is represented in the public sequence databases by the 
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accession NM_005950 and is described in this patent by Seq ID 244. Metallothioneins can act as an 
antioxidant and free-radical scavenger and are therefore protective against cell death in hypoxia. Hypoxia 
is an important feature of several diseases, and genes that respond to this stimulus are therefore implicated 
in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
5 HIFlalpha and EPAS1 are transcription factors which mediate the response to hypoxia of several genes, 
and have them selves been implicated in specific diseases. By adenoviral over-expression of HIFlalpha 
we show augmentation of the hypoxic induction of certain genes, further confirming their status as 
responsive to hypoxia. Metallothionein 1G has been shown to be induced by hypoxia to a greater degree 
following adenoviral over-expression of HIFlalpha. Hepatocytes are the main cell type of the liver and 

10 genes which are induced in response to hypoxia in this cell type are relevant to development of 
diagnostics and therapeutics towards liver diseases involving hypoxia, including cirrhosis. 
Metallothionein 1G is preferentially induced by hypoxia in hepatocytes. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 

15 expression responses to both hypoxia and cytokines are especially relevant. Metallothionein 1G is 
induced in macrophages activated by LPS and gamma interferon. Hypoxia is frequently found in human 
tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast 
cancer, Metallothionein 1G is up-regulated in the malignant tissue as compared to adjacent normal tissue 
in at least one patient. 

20 The protein sequence encoded by Metallothionein IE (functional) is not represented in the public 
databases by a protein accession. The nucleotide sequence is represented in the public sequence databases 
by the accession AA872383 and is described in this patent by Seq ID 246. Metallothioneins can act as an 
antioxidant and free-radical scavenger and are therefore protective against cell death in hypoxia. Hypoxia 
is an important feature of several diseases, and genes that respond to this stimulus are therefore implicated 

25 in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clones pi A 1 , plA2, pi A3 and plA4 represent SLC2A3. The protein sequence 
encoded by SLC2A3 .is represented in the public databases by the accession NP_008862 and is described 
in this patent by Seq ID 247. The nucleotide sequence is represented in the public sequence databases by 
the accession NM_006931 and is described in this patent by Seq ID 248. Hypoxia is an important feature 
30 of several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis 
and have utility in the design of therapeutic, prognostic and diagnostic products. SLC2A3 is induced in 
macrophages treated with the inhibitory cytokine IL-10. TNFalpha is an inflammatory cytokine, which 
acts on macrophages, and has been shown to be central to the pathophysiology and treatment of diseases 
including rheumatoid arthritis. Genes that change in expression in response to TNFalpha therefore have 
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utility in the design of therapeutic, prognostic and diagnostic products for such inflammatory conditions. 
SLC2A3 is induced in macrophages activated by TNFalpha. 

The Oxford BioMedica clones plA15, pi A 16, pi A 17 and pi A 1 8 represent Hexokinase-2. The protein 
sequence encoded by Hexokinase-2 is represented in the public databases by the accession NP_000180 
5 and is described in this patent by Seq ID 249. The nucleotide sequence is represented in the public 
sequence databases by the accession NM_000189 and is described in this patent by Seq ID 250. Hypoxia 
is an important feature of several diseases, and genes that respond to this stimulus are therefore implicated 
in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 
10 these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 
sites, so gene expression responses- to both hypoxia and cytokines are especially relevant. Hexokinase-2 is 
repressed in macrophages activated by LPS and gamma interferon. 

The Oxford BioMedica clones pi B 14, plB 15 and p 1 B 1 6 represent Interleukin 8. The protein sequence 
encoded by Interleukin 8 is represented in the public databases by the accession NP_000575 and is 

15 described in this patent by Seq ID 251. The nucleotide sequence is represented in the public sequence 
databases by the accession NM__000584 and is described in this patent by Seq ID 252. Hypoxia is an 
important feature of several diseases, and genes that respond-to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 

20 these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 
sites, so gene expression responses to both hypoxia and cytokines are especially relevant. Interleukin 8 is 
induced in macrophages activated by LPS and gamma interferon and is also induced in macrophages 
activated by IL-17. TNFalpha is an inflammatory cytokine, which acts on macrophages, and has been 
shown to be central to the pathophysiology and treatment of diseases including rheumatoid arthritis. 

25 Genes that change in expression in response to TNFalpha therefore have utility in the design of 
therapeutic, prognostic and diagnostic produ.cts for such inflammatory conditions. Interleukin 8 is induced 
in macrophages activated by TNFalpha. Hypoxia is frequently found in human tumours where 
macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, 
Interleukin 8 is down-regulated in the malignant tissue as compared to adjacent normal tissue in at least 

30 . one patient. 

The Oxford BioMedica clones pi A 1 1 and pi A 12 represent GAPDH. The protein sequence encoded by 
GAPDH is represented in the public databases by the accession NP_002037 and is described in this patent 
by Seq ID 253. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_002046 and is described in this patent by Seq ID 254. Hypoxia is an important feature of several 



WO 02/46465 



PCT/GB01/05458 



diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
5 expression responses to both hypoxia and cytokines are especially relevant. GAPDH is repressed in 
macrophages activated by LPS and gamma interferon and is also induced in macrophages activated by IL- 
17 or IL-15. TNFalpha is an inflammatory cytokine, which acts on macrophages, and has been shown to 
be central to the pathophysiology and treatment of diseases including rheumatoid arthritis. Genes that 
change in expression in response to TNFalpha therefore have utility in the design of therapeutic, 
10 prognostic and diagnostic products for such inflammatory conditions. GAPDH is induced in macrophages 
activated by TNFalpha. Hypoxia is frequently found in human tumours where macrophage infiltrates are 
also found. In a series of 5 patients with either ovarian or breast cancer, GAPDH is up-regulated in the 
malignant tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone pi A 1 3 represents Phosphoglycerate kinase 1. The protein sequence 
15 encoded by Phosphoglycerate kinase 1 is represented in the public databases by the accession NP_000282 
and is described in this patent by Seq ID 255. The nucleotide sequence is represented in the public 
sequence databases by the accession NM_000291 and is described in this patent by Seq ID 256. Hypoxia 
is an important feature of several diseases, and genes that respond to this stimulus are therefore implicated 
in the pathogenesis and have utility in the design of therapeutic,. prognostic and diagnostic products. 
20 Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 
these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 
sites, so gene expression responses to both hypoxia and cytokines are especially relevant. 
Phosphoglycerate kinase 1 is repressed in macrophages activated by LPS and gamma interferon. 
TNFalpha is an inflammatory cytokine, which acts on macrophages, and has been shown to be central to 
25 the pathophysiology and treatment of diseases including rheumatoid arthritis. Genes that change in 
expression in response to TNFalpha therefore have utility in the design of therapeutic, prognostic and 
diagnostic products for such inflammatory conditions. Phosphoglycerate kinase 1 is induced in 
macrophages activated by TNFalpha. 

The Oxford BioMedica clone pi A 14 represents Enolase 1. The protein sequence encoded by Enolase 1 is 
30 represented in the public databases by the accession NPJ101419 and is described in this patent by Seq ID 
257. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_001428 and is described in this patent by Seq ID 258. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
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diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant, Enolase 1 is repressed in 
macrophages activated by LPS and gamma interferon. TNFalpha is an inflammatory cytokine, which acts 
5 on macrophages, and has been shown to be central to the pathophysiology and treatment of diseases 
including rheumatoid arthritis. Genes that change in expression in response to TNFalpha therefore have 
utility in the design of therapeutic, prognostic and diagnostic products for such inflammatory conditions. 
Enolase 1 is induced in macrophages activated by TNFalpha. Hypoxia is frequently found in human 
tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast 
10 cancer, Enolase 1 is up-regulated in the malignant tissue as compared to adjacent normal tissue in at least 
one patient. 

The Oxford BioMedica clone pi A 19 represents Aldolase C. The protein sequence encoded by Aldolase C 
is represented in the public databases by the accession NP_005 1 56 and is described in this patent by Seq 
ID 259. The nucleotide sequence is represented in the public sequence databases by the accession 

15 NM_005165 and is described in this patent by Seq ID 260. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 

20 expression responses to both hypoxia and cytokines are especially relevant. Aldolase C is induced in 
macrophages activated by IL-15. Aldolase C is repressed in macrophages activated by IL-15. TNFalpha is 
an inflammatory cytokine, which acts on macrophages, and has been shown to be central to the 
pathophysiology and treatment of diseases including rheumatoid arthritis. Genes that change in 
expression in response to TNFalpha therefore have utility in the design of therapeutic, prognostic and 

25 diagnostic products for such inflammatory conditions, Aldolase C is induced in macrophages activated by 
TNFalpha. Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. 
In a series of 5 patients with either ovarian or breast cancer, Aldolase is up-regulated in the malignant 
tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone pi A20 represents Triosephosphate isomerase 1. The protein sequence 
30 encoded by Triosephosphate isomerase 1 is represented in the public databases by the accession 
NP_000356 and is described in this patent by Seq ID 261. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_000365 and is described in this patent by Seq ID 262. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
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products. Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory 
processes, In these, macrophages are frequently activated by cytokines, which have been shown to be 
present at disease sites, so gene expression responses to both hypoxia and cytokines are especially 
relevant. Triosephosphate isomerase 1 is repressed in macrophages activated by LPS and gamma 
5 interferon and is also repressed in macrophages activated by IL-15. TNFalpha is an, inflammatory 
cytokine, which acts on macrophages, and has been shown to be central to the pathophysiology and 
treatment of diseases including rheumatoid arthritis. Genes that change in expression in response to 
TNFalpha therefore have utility in the design of therapeutic, prognostic and diagnostic products for such 
inflammatory conditions. Triosephosphate isomerase 1 is induced in macrophages activated by TNFalpha. 

10 The Oxford BioMedica clone plA23 represents Metallothionein 2A. The protein sequence encoded by 
Metallothionein 2A is represented in the public databases by the accession NP_005944 and is described in 
this patent by Seq ID 265. The nucleotide sequence is represented in the public sequence databases by the 
accession NM_005953 and is described in this patent by Seq ID 266. Metallothioneins can act as an 
antioxidant and free-radical scavenger and are therefore protective against cell death in hypoxia. Hypoxia 

15 is an important feature of several diseases, and genes that respond to this stimulus are therefore implicated 
in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
HIFlalpha and EPAS1 are transcription factors which mediate the response to hypoxia of several genes, 
and have them selves been implicated in specific diseases. By adenoviral over-expression of HIFlalpha 
we show augmentation of the hypoxic induction of certain genes, further confirming their status as 

20 responsive to hypoxia. Metallothionein 2A has been shown to be induced by hypoxia to a greater degree 
following adenoviral over-expression of HIFlalpha. Hepatocytes are the main cell type of the liver and 
genes which are induced in response to hypoxia in this cell type are relevant to development of 
diagnostics and therapeutics towards liver diseases involving hypoxia, including cirrhosis, 
Metallothionein 2A is preferentially induced by hypoxia in hepatocytes. Macrophages are key to several 

25 diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. Metallothionein 2A is 
induced in macrophages activated by LPS and gamma interferon and also induced in macrophages 
activated by IL-15. Hypoxia is frequently found in human tumours where macrophage infiltrates are also 

30 found. In a series of 5 patients with either ovarian or breast cancer, Metallothionein 2A is down-regulated 
in the malignant tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clones plB20 and p 1 B 21 represent Osteopontin. The protein sequence encoded 
by Osteopontin is represented in the public databases by the accession NP_000573 and is described in this 
patent by Seq ID 267. The nucleotide sequence is represented in the public sequence databases by the 
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accession NM_000582 and is described in this patent by Seq ID 268. Hypoxia is an important feature of 
several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 
have utility in the design of therapeutic, prognostic and diagnostic products. There is a prejudice in the art 
that the response to hypoxia is generic to all cell types. Contrary to this, we show that genes are regulated 
5 by hypoxia to a greater degree in certain cell types, substantiating their utility in designing specific 
therapeutic products for diseases involving those cell types, Monocytes and macrophages have been 
implicated in the following diseases involving hypoxia: rheumatoid arthritis, atherosclerosis, cancer, 
COPD. Osteopontin is preferentially induced by hypoxia in monocytes or macrophages and a restricted 
number of other cell types. Osteopontin has been shown recently to play a role in autoimmune disease 

10 [Chabas et al, 2001, Science 294: 1731-5]. We present a new association between the hypoxic response 
and autoimmune disease. Macrophages are key to several diseases involving hypoxia, and contribute to 
inflammatory processes. In these,. macrophages are frequently activated by cytokines, which have been 
shown to be present at disease sites, so gene expression responses to both hypoxia and cytokines are 
especially relevant. Osteopontin is repressed in macrophages activated by LPS and gamma interferon. 

15 Hypoxia is frequently found in human tumours where macrophage infiltrates are also found, In a series of 
5 patients with either ovarian or breast cancer, Osteopontin is up-regulated in the malignant tissue as 
compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clones p 1 C 1 7 and plC18 represent Granulin. The protein sequence encoded by 
Granulin is represented in the public databases by the accession NP_002078 and is described in this 

20 patent by Seq ID 269, The nucleotide sequence is represented in the public sequence databases by the 
accession NM_002087 and is described in this patent by Seq ID 270. Hypoxia is an important feature of 
several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 
have utility in the design of therapeutic, prognostic and diagnostic products, Macrophages are key to 
several diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 

25 frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. Granulin is repressed in 
macrophages activated by LPS and gamma interferon. Hypoxia is frequently found in human tumours 
where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, 
Granulin is up-regulated in the malignant tissue as compared to adjacent normal tissue in at least one 

30 patient. The up-regulation of Granulin, which is a known growth factor, is a clinically significant feature 
of the hypoxic response with clear diagnostic and therapeutic utility. 

The Oxford BioMedica clone plD8 represents Hypoxia-inducible protein 2. The protein sequence 
encoded by Hypoxia-inducible protein 2 is represented in the public databases by the accession 
NP_037464 and is described in this patent by Seq ID 271, The nucleotide sequence rs represented in the 
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public sequence databases by the accession NM_013332 and is described in this patent by Seq ID 272. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. Hypoxia-inducible protein 2 is induced in macrophages treated with the inhibitory cytokine IL- 
5 10. Hypoxia-inducible protein 2 is repressed in macrophages activated by IL-17 and is also repressed in 
macrophages activated by IL-15. 

The Oxford BioMedica clone plAlO represents Enolase 2. The protein sequence encoded by Enolase 2 is 
represented in the public databases by the accession NP_001 966 and is described in this patent by Seq ID 
273. The nucleotide sequence is represented in the public sequence databases by the accession 

10 NM_001975 and is described in this patent by Seq ID 274. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 

15 expression responses to both hypoxia and cytokines are especially relevant. Enolase 2 is repressed in 
macrophages activated by LPS and gamma interferon. Hypoxia is frequently found in human tumours 
where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, 
Enolase 2 is up-regulated in the malignant tissue as compared to adjacent normal tissue in at least one 
patient. We expect it to have an anti-inflammatory role. 

20 The Oxford BioMedica clone plG24 represents Glycogen synthase 1. The protein sequence encoded by 
Glycogen synthase 1 is represented in the public databases by the accession NP_002094 and is described 
in this patent by Seq ID 275. The nucleotide sequence is represented in the public sequence databases by 
the accession NM_002103 and is described in this patent by Seq ID 276. Hypoxia is an important feature 
of several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis 

25 and have utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to 
several diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. Glycogen synthase 1 is 
repressed in macrophages activated by IL-17 and is also repressed in macrophages activated by IL-15. 

30 The Oxford BioMedica clone plG23 represents ALCAM. The protein sequence encoded by ALCAM is 
represented in the public databases by the accession NP_001618 and is described in this patent by Seq ID 
277. The nucleotide sequence is represented in the public sequence databases by the accession 
NM J)01627 and is described in this patent by Seq ID 278. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
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utility in the design of therapeutic, prognostic and diagnostic products. In view of the recently-discovered 
role of ALCAM in angiogenesis [Ohneda et al, 2001, Blood 2001 Oct l;98(7):2134-42], our 
demonstration of hypoxic regulation of ALCAM has great clinical significance in the treatment and 
diagnosis of vascular disease and cancer. 

5 The Oxford BioMedica clone pl'G7 represents an unannotated EST. The protein sequence encoded by this 
EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession BC008022 and is described in this patent 
by Seq ID 282. Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design- of therapeutic, 

10 prognostic and diagnostic products. Macrophages are key to several diseases involving hypoxia, and 
contribute to inflammatory processes. In these, macrophages are frequently activated by cytokines, which 
have been shown to be present at disease sites, so gene expression responses to both hypoxia and 
cytokines are especially relevant. The EST represented by Seq ID 282 is repressed in macrophages 
activated by LPS and gamma interferon, We expect the product of EST represented by Seq ID 282 to 

15 have an anti-inflammatory role. 

The Oxford BioMedica clone p2A23 represents Chitinase 3-like 2. The protein sequence encoded by 
Chitinase 3-like 2 is represented in the public databases by the accession NP_003991 and is described in 
this patent by Seq ID 283. The nucleotide sequence is represented in the public sequence databases by the 
accession NM_004000 and is described in this patent by Seq ID 284. Hypoxia is an important feature of 
20 several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 
have utility in the design of therapeutic, prognostic and diagnostic products. Hypoxia is frequently found 
in human tumours where macrophage infiltrates are also found. In a series of 5 patients with either 
ovarian or breast cancer, Chitinase 3-like 2 is down-regulated in the malignant tissue as compared to 
adjacent normal tissue in at least one patient. 

25 The Oxford BioMedica clone plGl represents BACH1. The protein sequence encoded by BACH1 is 
represented in the public databases by the accession NPJ)01 177 and is described in this patent by Seq ID 
285. The nucleotide sequence is represented in the public sequence databases by the accession 
NMJ)01186 and is described in this patent by Seq ID 286. BACH1, a novel helicase-like protein, 
interacts directly with BRCA1 and contributes to its DNA repair function. Hypoxia is an important 

30 feature of several diseases, and genes that respond to this stimulus are therefore implicated in the 
pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. The 
induction by hypoxia of this known transcriptional repressor and potential oncogene [Cantor et al 2001, 
Cell 105:149-60] is a very significant finding with profound implications for the diagnosis and treatment 
of cancer. 
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The Oxford BioMedica clone pi G 15 represents Phosphoglucomutase 1. The protein sequence encoded by 
Phosphoglucomutase 1 is represented in the public databases by the accession NP_002624 and is 
described in this patent by Seq ID 287. The nucleotide sequence is represented in the public sequence 
databases by the accession NM _002633 and is described in this patent by Seq ID 288. Hypoxia is an 
5 important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
Phosphoglucomutase 1 is induced in macrophages treated with the inhibitory cytokine IL-10. 

The Oxford BioMedica clone plF23 represents Hypothetical protein LOC51014. The protein sequence 
encoded by Hypothetical protein LOC51014 is represented in the public databases by the accession 

10 Q9Y3B3 and is described in this patent by Seq ID 289. The nucleotide sequence is represented in the 
public sequence databases by the accession AF151867 and is described in this patent by Seq ID 290. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products, Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In 

15 a series of 5 patients with either ovarian or breast cancer, Hypothetical protein LOC51014 is down- 
regulated in the malignant tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone plG8 represents Sin3-associated polypeptide. The protein sequence 
encoded by Sin3-associated polypeptide is represented in the public databases by the accession 
NP_003855 and is described in this patent by Seq ID 291. The nucleotide sequence is represented in the 
20 public sequence databases by the accession NM_003864 and is described in this patent by Seq ID 292. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 

The Oxford BioMedica clone pi G 1 3 represents ABCA1. The protein sequence encoded by ABCA1 is 
25 represented in the public databases by the accession NP J)05493 and is described in this patent by Seq ID 
293. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_005502 and is described in this patent by Seq ID 294. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
30 diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant, ABCA1 is repressed in 
macrophages activated by LPS and gamma* interferon. The hypoxia induction of ABCA1, which is known 
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to be relevant to atherosclerosis [Kielar et al 2001, Clin Chem 47:2089-97], has profound implications for 
the diagnosis and treatment of this disease. 

The Oxford BioMedica clone plGlO represents SEC24 member A. The protein sequence encoded by 
SEC24 member A is represented in the public databases by the accession CAA10334 and is described in 
5 this patent by Seq ID 295. The nucleotide sequence is represented in the public sequence databases by the 
accession AJ131244 and is described in this patent by Seq ID 296. Hypoxia is an important feature of 
severaldiseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 
have utility in the design of therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone plF24 represents Glia-derived nexin. The protein sequence encoded by 
10 Glia-derived nexin is represented in the public databases by the accession AAA35883 and is described in 
this patent by Seq ID 297. The nucleotide sequence is represented in the public sequence databases by the 
accession M 17783 and is described in this patent by Seq ID 298. Glia-derived nexin is a glycoprotein that 
functions as a serine protease inhibitor with activity towards thrombin, trypsin and urokinase. It is known 
to play a role in neuro-degeneration [Seidel et al 1998, Brain Res Mol Brain Res 60:296-300]. Thus the 
15 hypoxia induction of this gene is highly significant for the diagnosis and treatment of neurodegenerative 
disease. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are 
therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
diagnostic products. Macrophages are key to several diseases involving hypoxia, and contribute to 
inflammatory processes. In these, macrophages are frequently activated by cytokines, which have been 
20 shown to be present at disease sites, so gene expression responses to both hypoxia and cytokines are 
especially relevant. Glia-derived nexin is induced in macrophages activated by LPS and gamma 
interferon. Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In 
a series of 5 patients with either ovarian or breast cancer, Glia-derived nexin is down-regulated in the 
malignant tissue as compared to adjacent normal tissue in at least one patient. 

25 The Oxford BioMedica clone plG2 represents Postsynaptic density-95. The protein sequence encoded by 
Postsynaptic density-95 is represented in the public databases by the accession NP_00 1 356 and is 
described in this patent by Seq ID 299. The nucleotide sequence is represented in the public sequence 
databases by the accession NM_001365 and is described in this patent by Seq ID 300. Postsynaptic 
density-95 belongs to the MAGUK family of cell junction proteins. Hypoxia is an important feature of 

30 several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 
have utility in the design of therapeutic, prognostic and diagnostic products. The recent demonstration for 
a possible role for Postsynaptic density-95 in ischaemic pre-conditioning [Tauskela et al 2001, 
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Neuroscience 107:571-584] underlines the medical significance of our determination of the hypoxic 
regulation of this gene, and its utility in the diagnosis and treatment of ischaemic disease. 

The Oxford BioMedica clone plGll represents Tumour protein D52, The protein sequence encoded by 
Tumour protein D52 is represented in the public databases by the accession NP_005070 and is described 
5 in this patent by Seq ID 301. The nucleotide sequence is represented in the public sequence databases by 
the accession NM_005079 and is described in this patent by Seq ID 302. Hypoxia is an important feature 
of several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis 
and have utility in the design of therapeutic, prognostic and diagnostic products. The response of renal 
epithelial cells to hypoxia is pertinent to kidney failure, especially regarding the medullary, tissue. Tumour 
10 protein D52 is preferentially induced by hypoxia in renal epithelial cells, Hypoxia is frequently found in 
human tumours where macrophage infiltrates are also found, In a series of 5 patients with either ovarian 
or breast cancer, Tumour protein D52 is up-regulated in the malignant tissue as compared to adjacent 
normal tissue in at least one patient. Our observation of hypoxia-regulation of this tumour-associated 
protein is highly significant for the diagnosis and treatment of cancer, 

15 The Oxford BioMedica clone p 1 G 1 6 represents p27, Kipl. The protein sequence encoded by p27, Kipl is 
represented in the public databases by the accession NP_004055 and is described in this patent by Seq ID 
303, The nucleotide sequence is represented in the public sequence databases by the accession 
NM_004064 and is described in this patent by Seq ID 304. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 

20 utility in the design of therapeutic, prognostic and diagnostic products. The hypoxia regulation of this 
anti-mitogen has important utility in oncology and angiogenesis [Fouty et al 2001, Am J Respir Cell Mol 
Biol 25:652-658], 

The Oxford BioMedica clone plG9 represents PI-3-kinase, catalytic, beta polypeptide, The protein 
sequence encoded by PI-3-kinase, catalytic, beta polypeptide is represented in the public databases by the 

25 accession NP_006210 and is described in this patent by Seq ID 305. The nucleotide sequence is 
represented in the public sequence databases by the accession NM_006219 and is described in this patent 
by Seq ID 306. Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. Macrophages are key to several diseases involving hypoxia, and 

30 contribute to inflammatory processes. In these, macrophages are frequently activated by cytokines, which 
have been shown to be present at disease sites, so gene expression responses to both hypoxia and 
cytokines are especially relevant. PI-3-kinase, catalytic, beta polypeptide is repressed in macrophages 
activated by LPS and gamnra interferon. The very recent publication of a role for PI3 kinase in 
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angiogenesis induced by hypoxic pre-conditioning [Zhu et al 2001, FEBS Lett 508:369-74] re-enforces 
the clinical utility which we claim for this gene as a result of its hypoxia-induction. 

The Oxford BioMedica clone plG4 represents SLC5A3. The protein sequence encoded by SLC5A3 is 
represented in the public databases by the accession AAC39548 and is described in this patent by Seq ID 
5 307. The nucleotide sequence is represented in the public sequence databases by the accession AF027153 
and is described in this patent by Seq ID 308. Hypoxia is an important feature of several diseases, and 
genes that respond to this stimulus are therefore implicated in the pathogenesis and have utility in the 
design of therapeutic, prognostic and diagnostic products. SLC5A3 is over-expressed in the brains of 
children with Downs Syndrome, and may play a role in brain pathology [Berry et al 1999, J Pediatr 

10 135:94-7], Thus our claims of clinical utility following from hypoxia induction have great medical 
significance for the diagnosis and treatment of ischaemic and degenerative disease. Macrophages are key 
to several diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages 
are frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. SLC5A3 is repressed in 

15 macrophages activated by LPS and gamma interferon. 

The Oxford BioMedica clone p 1 G 1 4 represents Cytohesin binding protein. The protein sequence encoded 
by Cytohesin binding protein is represented in the public databases by the accession NP_004279 and is 
described in this patent by Seq ID 309. The nucleotide sequence is represented in the public sequence 
databases by the accession NM_004288 and is described in this patent by Seq ID 310. Hypoxia is an 
20 important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
Cytohesin has been shown to modulate PI3-kinase activity [Dierks et al 2001, J Biol Chem 276:37472- 
81], re-enforcing our claim here and elsewhere in this filing of the relevance to the hypoxic response of 
pathways controlled by the critical second-messenger PI3. 

25 The Oxford BioMedica clones plA5 and plA6 represent SLC2A5. The protein sequence encoded by 
SLC2A5 is represented in the public databases by the accession NP_003030 and is described in this 
patent by Seq ID 311. The nucleotide sequence is represented in the public sequence databases by the 
accession NM_003039 and is described in this patent by Seq ID 312, Hypoxia is an important feature of 
several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 

30 have utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to 
several diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. SLC2A5 is repressed in 
macrophages activated by LPS and gamma interferon. Hypoxia is frequently found in human tumours 
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where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, 
SLC2A5 is up-regulated in the malignant tissue as compared to adjacent normal tissue in at least one 
patient. 

The Oxford BioMedica clones plB6, plB7, plB8 and plB9 represent Adipophilin. The protein sequence 
5 encoded by Adipophilin is represented in the public databases by the accession NP_001 113 and is 
described in this patent by Seq ID 313. The nucleotide sequence is represented in the public sequence 
databases by the accession NM_001 122 and is described in this patent by Seq ID 314. Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. There is 

10 a prejudice in the art that the response to hypoxia is generic to all cell types. Contrary to this, we show 
that genes are regulated by hypoxia to a greater degree in certain cell types, substantiating their utility in 
designing specific therapeutic products for diseases involving those cell types. Monocytes and 
macrophages have been implicated in the following diseases involving hypoxia: rheumatoid arthritis, 
atherosclerosis, cancer, COPD, The hypoxia induction of adipophilin h.as profound implications for the 

15 causation, diagnosis and treatment of atherosclerosis, because this protein plays a key role in the uptake 
of lipid and foam cell formation [Buechler et al 2001, Biochim Biophys Acta 1532:97-104]. Adipophilin 
is preferentially induced by hypoxia in monocytes or macrophages and a restricted number of other cell 
tyVes. Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory 
processes. In these, macrophages are frequently activated by cytokines, which have been shown to be 

20 present at disease sites, so gene expression responses to both hypoxia and cytokines are especially 
relevant. Adipophilin is repressed in macrophages activated by LPS and gamma interferon and is also 
repressed in macrophages activated by IL-15, TNFalpha is an inflammatory cytokine, which acts on 
macrophages, and has been shown to be central to the pathophysiology and treatment of diseases 
including rheumatoid arthritis. Genes that change in expression in response to TNFalpha therefore have 

25 utility in the design of therapeutic, prognostic and diagnostic products for such inflammatory conditions. 
Adipophilin is induced in macrophages activated by TNFalpha. Hypoxia is frequently found in human 
tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast 
cancer, Adipophilin is up-regulated and also down-regulated in the malignant tissue as compared to 
adjacent normal tissue in at least one patient. 

30 The Oxford BioMedica clone plG 17 represents Early development regulator 2. The protein sequence 
encoded by Early development regulator 2 is represented in the public databases by the accession 
NP_004418 and is described in this patent by Seq ID 315. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_004427 and is described in this patent by Seq ID 316. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
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implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory 
processes. In these, macrophages are frequently activated by cytokines, which have been shown- to be 
present at disease sites, so gene expression responses to both hypoxia and cytokines are especially 
5 relevant. Early development regulator 2 is repressed in macrophages activated by LPS and gamma 
interferon. 

The Oxford BioMedica clone plG3 represents B -cell translocation gene 1. The protein sequence encoded 
by B-cell translocation gene 1 is represented in the public databases by the accession NP_001722 and is 
described in this patent by Seq ID 317. The nucleotide sequence is represented in the public sequence 

10 databases by the accession NM_001731 and is described in this patent by Seq ID 318, Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 
these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 

15 sites, so gene expression responses to both hypoxia and cytokines are especially relevant. B-cell 
translocation gene 1 is induced in macrophages activated by LPS and gamma interferon. Hypoxia is 
frequently found in human tumours where macrophage infiltrates are also found. In a series of 5 patients 
with either ovarian or breast cancer, B-cell translocation gene 1 is down-regulated in the malignant tissue 
as compared to adjacent normal tissue in at least one patient. 

20 The Oxford BioMedica clone plF22 represents Sorting nexin 9. The protein sequence encoded by Sorting 
nexjn 9 is represented in the public databases by the accession NP_057308 and is described in this patent 
by Seq ID 319. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_016224 and is described in this patent by Seq ID 320. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 

25 utility in the design of therapeutic, prognostic and diagnostic products, Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant, TNFalpha is an inflammatory 
cytokine, which acts on macrophages, and has been shown to be central to the pathophysiology and 

30 treatment of diseases including rheumatoid arthritis. Genes that change in expression in response to 
TNFalpha therefore have utility in the design of therapeutic, prognostic and diagnostic products for such 
inflammatory conditions. Sorting nexin 9 is induced in macrophages activated by TNFalpha. 

The Oxford BioMedica clone plG12 represents Cyclin G2, The protein sequence encoded by Cyclin G2 
is represented in the public, databases by the accession NP_004345 and is described in this patent by Seq 
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ID 321. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_004354 and is described in this patent by Seq ID 322, Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
5 diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. Cyclin G2 is repressed in 
macrophages activated by LPS and gamma interferon. 

The Oxford BioMedica clone plFll represents Hypothetical protein LOC51754. The protein sequence 
10 encoded by Hypothetical protein LOC51754 is represented in the public databases by the accession 
XP_049657 and is described in this patent by Seq ID 323. The nucleotide sequence is represented in the 
public sequence databases by the accession AL137430 and is described in this patent by Seq ID 324. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
15 products. Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory 
processes. In these, macrophages are frequently activated by cytokines, which have been shown to be 
present at disease sites, so gene expression responses to both hypoxia and cytokines are especially 
relevant. Hypothetical protein LOC51754 is repressed in macrophages activated by LPS and gamma 
interferon. Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In 
20 a series of 5 patients with either ovarian or breast cancer, Hypothetical protein LOC51754 is up-regulated 
in the malignant tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone p 1 F 1 4 represents Butyrate response* factor 1. The protein sequence encoded 
by Butyrate response factor 1 is represented in the public databases by the accession NP_004917 and is 
described in this patent by Seq ID 327. The nucleotide sequence is represented in the public sequence 

25 databases by the accession NM_004926 and is described in this patent by Seq ID 328. Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
HIF1 alpha and EPAS1 are transcription factors which mediate the response to hypoxia of several genes, 
and have them selves been implicated in specific diseases. By adenoviral over-expression of EPAS1 we 

30 show augmentation of the hypoxic induction of certain genes, further confirming their status as 
responsive to hypoxia. Butyrate response factor 1 has been shown to be induced by hypoxia to a greater 
degree following adenoviral over-expression of EPAS1. Hypoxia is frequently found in human tumours 
where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, 
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Butyrate response factor 1 is down-regulated in the malignant tissue as compared to adjacent normal 
tissue in at least one patient. 

The Oxford BioMedica clone plF17 represents P8 protein (candidate of metastasis 1), The protein 
sequence encoded by P8 protein (candidate of metastasis 1) is represented in the public databases by the 
5 accession NP_036517 and is described in this patent by Seq ID 329, The nucleotide sequence is 
represented in the public sequence databases by the accession NM_012385 and is described in this patent 
by Seq ID 330. Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. Macrophages are key to several diseases involving hypoxia, and 

10 contribute to inflammatory processes, In these, macrophages are frequently activated by cytokines, which 
have been shown to be present at disease sites, so gene expression responses to both hypoxia and- 
'cytokines are especially relevant. P8 protein (candidate of metastasis 1) is induced in macrophages 
activated by LPS and gamma interferon, Hypoxia is frequently found in human tumours where 
macrophage infiltrates are also found, In a series of 5 patients with either ovarian ot breast cancer, P8 

15 protein (candidate of metastasis 1) is up-regulated and also down-regulated in the malignant tissue as 
compared to adjacent normal tissue in at least one patient, 

The Oxford BioMedica clones plCl and pi C2 represent CXCR4. The protein sequence encoded by 
CXCR4 is represented in the public databases by the accession NP_003458 and is described in this patent 

■ • by Seq ID 331. The nucleotide sequence is represented in the public sequence databases by the accession 

20 NM_003467 and is described in this patent by Seq ID 332. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several- 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 

25 expression responses to both hypoxia and cytokines are especially relevant, CXCR4 is repressed in 
macrophages activated by LPS and gamma interferon. TNFalpha is an inflammatory cytokine, which acts 
on macrophages, and has been shown to be central to the pathophysiology and treatment of diseases 
including rheumatoid arthritis. Genes that change in expression in response to TNFalpha therefore have 
utility in the design of therapeutic, prognostic and diagnostic products for such inflammatory conditions. 

30 CXCR4 is induced in macrophages activated by TNFalpha. CXCR4 may act through the PI3-K pathway. 
Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In a series of 
5 patients with either ovarian or breast cancer, CXCR4 is up-regulated and also down-regulated in the 
malignant tissue as compared to adjacent normal tissue in at least one patient. 
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The Oxford BioMedica clone p 1 F3 represents Hypothetical protein XPJJ17131. The protein sequence 
encoded by Hypothetical protein XP_017131 is represented in the public databases by the accession 
XP_017131 and is described in this patent by Seq ID 333. The nucleotide sequence is represented in the 
public sequence databases by the accession XM_017131 and is described in this patent by Seq ID 334. 
5 Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory 
processes. In these, macrophages are frequently activated by cytokines, which have been shown to be 
present at disease sites, so gene expression responses to both hypoxia and cytokines are especially 
10 relevant. Hypothetical protein XP_0 17131 is repressed in macrophages activated by LPS and gamma 
interferon. Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In 
a series of 5 patients with either ovarian or breast cancer, Hypothetical protein XP_017131 is down- 
regulated in the malignant tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone pi F20 represents Proline-rich protein with nuclear targeting signal. The 

15 protein sequence encoded by Proline-rich protein with nuclear targeting signal is represented in the public 
databases by the accession NP_006804 and is described in this patent by Seq ID 335. The nucleotide 
sequence is represented in the public sequence databases by the accession NM_006813 and is described 
in. this patent by Seq ID 336. Hypoxia is an important feature of several diseases, and genes that respond 
to this stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 

20 prognostic and diagnostic products. Macrophages are key to several diseases involving hypoxia, and 
contribute to inflammatory processes. In these, macrophages are frequently activated by cytokines, which 
have been shown to be present at disease sites, so gene expression responses to both hypoxia and 
cytokines are especially relevant. TNFalpha is an inflammatory cytokine, which acts on macrophages, 
and has been shown to be central to the pathophysiology and treatment of diseases including rheumatoid 

25 arthritis. Genes that change in expression in response to TNFalpha therefore have utility in the design of 
therapeutic, prognostic and diagnostic products for such inflammatory conditions. Proline-rich protein 
with nuclear targeting signal is induced in macrophages activated by TNFalpha. Hypoxia is frequently 
found in human tumours where macrophage infiltrates are also found. In a series of 5 patients with either 
ovarian or breast cancer, Proline-rich protein with nuclear targeting signal is down-regulated in the 

30 malignant tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone plF6 represents Hypothetical protein hqp0376. The protein sequence 
encoded by Hypothetical protein hqp0376 is represented in the public databases by the accession T08745 
and is described in this patent by Seq ID 337. The nucleotide sequence is represented in the public 
sequence databases by the accession AF078844 and is described in this patent by Seq ID 338. 



WO 02/46465 



PCT/GB01/05458 



164 

Hypothetical protein hqp0376 is a putative dead box protein as it shows high structural similarity to dead 
box proteins and yeast initiation factor 4A. Hypoxia is an important feature of several diseases, and genes 
that respond to this stimulus are therefore implicated in the pathogenesis and have utility in the design of 
therapeutic, prognostic and diagnostic products. HIFlalpha and EPAS1 are transcription factors which 
5 mediate the response to hypoxia of several genes, and have them selves been implicated in specific 

diseases. By adenoviral over-expression of HIFlalpha we show augmentation of the hypoxic induction of 
certain genes, further confirming their status as responsive to hypoxia. Hypothetical protein hqp0376 has 
been shown to be induced by hypoxia to a greater degree following adenoviral over-expression of 
HIFlalpha. Hepatocytes are the main cell type of the liver and genes which are induced in response to 

10 hypoxia in this cell type are relevant to development of diagnostics and therapeutics towards liver 

diseases involving hypoxia, including cirrhosis. Hypothetical protein hqp0376 is preferentially induced by 
hypoxia in hepatocytes. Macrophages are key to several diseases involving hypoxia, and contribute to 
inflammatory processes. In these, macrophages are frequently activated by cytokines, which have been 
shown to be present at disease sites, so gene expression responses to both hypoxia and cytokines are 

15 especially relevant. Hypothetical protein hqp0376 is induced in macrophages activated by LPS and 
gamma interferon. 

The Oxford BioMedica clone plF4 represents CYP1. The protein sequence encoded by CYP1 is 
represented in the public databases by the accession NP_000776 and is described in this patent by Seq ID 
339. The nucleotide sequence is represented in the public sequence databases by the accession 

20 NM_000785 and is described in this patent by Seq ID 340. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. There is a prejudice in the art that 
the response to hypoxia is generic to all cell types. Contrary to this, we show that genes are regulated by 
hypoxia to a greater degree in certain cell types, substantiating their utility in designing specific 

25 therapeutic products for diseases involving those cell types. Monocytes and macrophages have been 
implicated in the following diseases involving hypoxia: rheumatoid arthritis, atherosclerosis, cancer, 
COPD. CYP1 is preferentially induced by hypoxia in monocytes or macrophages. Macrophages are key 
to several diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages 
are frequently activated by cytokines, which have been shown to be present at disease sites, so gene 

30 expression responses to both hypoxia and cytokines are especially relevant. CYP1 is induced in 
. macrophages activated by LPS and gamma interferon. TNFalpha is an inflammatory cytokine, which acts 
on macrophages, and has been shown to be central to the pathophysiology and treatment of diseases 
including rheumatoid arthritis. Genes that change in expression in response to TNFalpha therefore have 



WO 02/46465 



PCT/GB01/05458 



165 

utility in the design of therapeutic, prognostic and diagnostic products for such inflammatory conditions. 
CYP1 is induced in macrophages activated by TNFalpha. 

The Oxford BioMedica clone plF15 represents SHB adaptor protein. The protein sequence encoded by 
SHB adaptor protein is represented in the public databases by the accession NP_003019 and is described 
5 in this patent by Seq ID 341. The nucleotide sequence is represented in the public sequence databases by 
the accession NM_003028 and is described in this patent by Seq ID 342. Hypoxia is an important feature 
of several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis 
and have utility in the design of therapeutic, prognostic and diagnostic products. SHB adaptor protein 
participates in tyrosine kinase-mediated signalling and the regulation of angiogenesis and apotosis 

10 [Dixelius J. 2000, Blood 95:3403-1 1]. Our surprising observation of the hypoxia regulation of this protein 
has clear medical utility in the diagnosis and treatment of vascular disease and cancer. Macrophages are 
key to several diseases involving hypoxia, and contribute to inflammatory processes. In these, 
macrophages are frequently activated by cytokines, which have been shown to be present at disease sites, 
so gene expression responses to both hypoxia and cytokines are especially relevant. SHB adaptor protein 

15 is repressed in macrophages activated by LPS and gamma interferon. 

The Oxford BioMedica clone p 1 Fl 3 represents Papillomavirus regulatory factor PRF-1. The protein 
sequence encoded by Papillomavirus regulatory factor PRF-1 is represented in the public databases by the 
accession NP__06 1 130 and is described in this patent by Seq ID 343. The nucleotide sequence is 
represented in the public sequence databases by the accession AK023418 and is described in this patent 

20 by Seq ID 344. Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. Macrophages are key to several diseases involving hypoxia, and 
contribute to inflammatory processes. In these, macrophages are frequently activated by cytokines, which 
have been shown to be present at disease sites, so gene expression responses to both hypoxia and 

25 cytokines are especially relevant. Papillomavirus regulatory factor PRF-1 is repressed in macrophages 
activated by LPS and gamma interferon. 

The Oxford BioMedica clone plA7 represents SLC31A2. The protein sequence encoded by SLC31A2 is 
represented in the public databases by the accession NP_001851 and is described in this patent by Seq ID 
345. The nucleotide sequence is represented in the public sequence databases by the accession 
30 NM_001860 and is described in this patent by Seq ID 346. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
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expression responses to both hypoxia and cytokines are especially relevant, SLC31A2 is induced in 
macrophages activated by LPS and gamma interferon. 

The Oxford BioMedica clone pi A21 represents UDP-glucose pyrophosphorylase 2. The protein sequence 
encoded by UDP-glucose pyrophosphorylase 2 is represented in the public databases by the accession 
5 NP_006750 and is described in this patent by Seq ID 347. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_006759 and is described in this patent by Seq ID 348. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 

10 The Oxford BioMedica clones plB4 and plB5 represent Proline 4-hydroxylase, alpha polypeptide II. The 
protein sequence encoded by Proline 4-hydroxylase, alpha polypeptide II is represented in the public 
databases by the accession NP_004190 and is described in this patent by Seq ID 349. The nucleotide 
sequence is represented in the public sequence databases by the accession NM_004199 and is described 
in this patent by Seq ID 350. Hypoxia is an important feature of several diseases, and genes that respond 

15 to this stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products, Macrophages are key to several diseases involving hypoxia, and 
contribute to inflammatory processes, In these, macrophages are frequently activated by cytokines, which 
have been shown to be present at disease sites, so gene expression responses to both hypoxia and 
cytokines are especially relevant. Proline 4-hydroxylase, alpha polypeptide II is repressed in macrophages 

20 activated by LPS and gamma interferon and is also repressed in macrophages activated by IL-15. 
TNFalpha is an inflammatory cytokine, which acts on macrophages, and has been shown to be central to 
the pathophysiology and treatment of diseases including rheumatoid arthritis. Genes that change in 
expression in response to TNFalpha therefore have utility in the design of therapeutic, prognostic and 
diagnostic products for such inflammatory conditions. Proline 4-hydroxylase, alpha polypeptide II is 

25 induced in macrophages activated by TNFalpha. Hypoxia is frequently found in human tumours where 
macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, Proline 
4-hydroxylase, alpha polypeptide II is up-regulated in the malignant tissue as compared to adjacent 
normal tissue in at least one patient. 

The Oxford BioMedica clones plBlO, plBll and plB 12 represent Stearoyl-CoA desaturase. The protein 
30 sequence encoded by Stearoyl-CoA desaturase is represented in the public databases by the accession 
NP_005054 and is described in this patent by Seq ID 351. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_005063 and is described in this patent by Seq ID 352. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
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products. Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory 
processes. In these, macrophages are frequently activated by cytokines, which have been shown to be 
present at disease sites, so gene expression responses to both hypoxia and cytokines are especially 
relevant. TNFalpha is an inflammatory cytokine, which acts on macrophages, and has been shown to be 
5 central to the pathophysiology and treatment of diseases including rheumatoid arthritis. Genes that change 
in expression in response to TNFalpha therefore have utility in the design of therapeutic, prognostic and 
diagnostic products for such inflammatory conditions. Stearoyl-CoA desaturase is induced in 
macrophages activated by TNFalpha. 

The Oxford BioMedica clone p 1 B 1 3 represents Diacylglycerol kinase, zeta. The protein sequence 
10 encoded by Diacylglycerol kinase, zeta is represented in the public databases by the accession 
NP„003637 and is described in this patent by Seq ID 353. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_003646 and is described in this patent by Seq ID 354. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
15 products. 

The Oxford BioMedica clone p 1 B 22 represents Protease, serine, 11. The protein sequence encoded by 
Protease, serine, 1 1 is represented in the public databases by the accession NP_002766 and is described in 
this patent by Seq ID 355. The nucleotide sequence is represented in the public sequence databases by the 
accession NM_002775 and is described in this patent by Seq ID 356. Hypoxia is an important feature of 

20 several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 
have utility in the design of therapeutic, prognostic and diagnostic products, Macrophages are key to 
several diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression .responses to both hypoxia and cytokines are especially relevant. Protease, serine, 11 is 

25 repressed in macrophages activated by LPS and gamma interferon. Hypoxia is frequently found in human 
tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast 
cancer, Protease, serine, 11 is down-regulated in the malignant tissue as compared to adjacent normal 
tissue in at least one patient. 

The Oxford BioMedica clone plB23 represents Interleukin 1 receptor antagonist. The protein sequence 
30 encoded by Interleukin 1 receptor antagonist is represented in the public databases by the accession 
NP_000568 and is described in this patent by Seq ID 357. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_000577 and is described in this patent by Seq ID 358. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
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products, There is a prejudice in the art that the response to hypoxia is generic to all cell types, Contrary 
to this, we show that genes are regulated by hypoxia to a greater degree in certain cell types, 
substantiating their utility in designing specific therapeutic products for diseases involving those cell 
types. Monocytes and macrophages have been implicated in the following diseases involving hypoxia: 
5 rheumatoid arthritis, atherosclerosis, cancer, COPD, Interleukin 1 receptor antagonist is preferentially 
induced by hypoxia in monocytes or macrophages. TNFalpha is an inflammatory cytokine, which acts on 
macrophages, and has been shown to be central to the pathophysiology and treatment of diseases 
including rheumatoid arthritis. Genes that change in expression in response to TNFalpha therefore have 
utility in the design of therapeutic, prognostic and diagnostic products for such inflammatory conditions, 
10 Interleukin 1 receptor antagonist is induced in macrophages activated by TNFalpha. 

The Oxford BioMedica clone plB24 represents NSl-binding protein. The protein sequence encoded by 
NSl-binding protein is represented in the public databases by the accession NP_006460 and is described 
in this patent by Seq ID 359. The nucleotide sequence is represented in the public sequence databases by 
the accession NM_006469 and is described in this patent by Seq ID 360. Hypoxia is an important feature 
15 of several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis 
and have utility in the design of therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone plC3 represents Activin A receptor, type I. The protein sequence encoded 
by Activin A receptor, type I is represented in the public databases by the accession NP_001096 and is 
described in this patent by Seq ID 361. The nucleotide sequence is represented in the public sequence 

20 databases by the accession NM_001 105 and is described in this patent by Seq ID 362. Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. Activin 
A is known to induce apoptosis [Hughes et al 1999, Prog Neurobiol 57:421-50], and so the regulation of 
its receptor by hypoxia has clear clinical significance. Hypoxia is frequently found in human tumours 

25 where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, 
Activin A receptor, type I is down-regulated in the malignant tissue as compared to adjacent normal tissue 
in at least one patient. 

The Oxford BioMedica clone plC4 represents FGF receptor activating protein 1. The protein sequence 
encoded by FGF receptor activating protein 1 is represented in the public databases by the accession 
30 NP_055304 and is described in this patent by Seq ID 363. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_014489 and is described in this patent by Seq ID 364. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. FGF has been shown to enhance survival of cardiac cells after ischaemic insult [Sheikh et al 
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2001, Am J Physiol Heart Circ Physiol 280:H 1 039-50], and so our observation of the hypoxia-regulation 
of the FGF receptor activating protein 1 is highly significant for the diagnosis and treatment of ischaemic 
disease. FGF receptor activating protein 1 is induced in macrophages treated with the inhibitory cytokine 
IL-10. Hypoxia is frequently found in human tumours where macrophage infiltrates are also found, In a 
5 series of 5 patients with either ovarian or breast cancer, FGF receptor activating protein 1 is up-regulated 
in the malignant tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone plC5 represents Galectin 8, The protein sequence encoded by Galectin 8 is 
represented in the public databases by the accession NP_006490 and is described in this patent by Seq ID 
365. The nucleotide sequence is represented in the public sequence databases by the accession 
10 NM_006499 and is described in this patent by Seq ID 366. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Galectin 8 is an important tumour 
marker [for review see Bidon et al 2001, Int J Mol Med 8:245-50], and so its hypoxia-regulation is highly 
significant clinically. 

15 The Oxford BioMedica clone plC6 represents Glucose phosphate isomerase. The protein sequence 
encoded by Glucose phosphate isomerase is represented in the public databases by the accession 
NP_000166 and is described in this patent by Seq ID 367. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_000175 and is described in this patent by Seq ID 368. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 

20 implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products, Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory 
processes. In these, macrophages are frequently activated by cytokines, which have been shown to be 
present at disease sites, so gene expression responses to both hypoxia and cytokines are especially 
relevant. Glucose phosphate isomerase is induced in macrophages activated by IL-17 and also induced in 

25 macrophages activated by IL-15. TNFalpha is an inflammatory cytokine, which acts on macrophages, and 
has been shown to be central to the pathophysiology and treatment of diseases including rheumatoid 
arthritis. Genes that change in expression in response to TNFalpha therefore have utility in the design of 
therapeutic, prognostic and' diagnostic products for such inflammatory conditions. Glucose phosphate 
isomerase is induced in macrophages activated by TNFalpha. Hypoxia is frequently found in human 

30 tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast 
cancer, Glucose phosphate isomerase is up-regulated in the malignant tissue as compared to adjacent 
normal tissue in at least one patient. 
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The Oxford BioMedica clone plC7 represents D123. The protein sequence encoded by D123 is 
represented in the public databases by the accession NP_006014 and is described in this patent by Seq ID 
369. The nucleotide sequence is represented in the public sequence databases by ihe accession 
NM_006023 and is described in this patent by Seq ID 370. Hypoxia is an important feature of several 
5 diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes* In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. D123 is repressed in 

10 macrophages activated by LPS and gamma interferon and is also repressed in macrophages activated by 
IL-15. D123 protein is an important regulator of the cell cycle [Onisto et al 1998, Exp Cell Res 242:451- 
.9]. Recently it has been shown to be regulated by modification and turnover [Okuda et al 2001, Cell 
Struct Funct 26:205-14]. We have shown the hypoxia-regulation of this protein, and also of several prolyl 
hydroxylases which are known to target proteins for ubiquitination and proteasomal degradation. We 

15 believe that concerted hypoxic control of D123 and its regulating prolyl hydroxylase is part of the means 
of hypoxic regulation of cell growth and tissue re-modelling. 

The Oxford BioMedica clone plC8 represents DEC-1. The protein sequence encoded by DEC-1 is 
represented in the public databases by the accession NP, 003661 and is described in this patent by Seq ID 
371. The nucleotide sequence is represented in the public sequence databases by the accession 

20 NM_003670 and is described in this patent by Seq ID 372. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. DEC-1 is a helix-loop-helix 
transcription factor, and its hypoxia-regulation is highly significant. The response of renal epithelial cells 
to hypoxia is pertinent to kidney failure, especially regarding the medullary tissue. DEC-1 is 

25 preferentially induced by hypoxia in renal epithelial cells. Hypoxia is frequently found in human tumours 
where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, 
DEC-1 is up-regulated and also down-regulated in the malignant tissue as compared to adjacent normal 
tissue in at least one patient. 

The Oxford BioMedica clone plC9 represents RAB-8b protein. The protein sequence encoded by RAB- 
30 8b protein is represented in the public databases by the accession NP_057614 and is described in this 
patent by Seq ID 373. The nucleotide sequence is represented in the public sequence databases by the 
accession NM_016530 and is described in this patent by Seq ID 374. Hypoxia is an important feature of 
several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 
have utility in the design of therapeutic, prognostic and diagnostic products. The hypoxia regulation of 
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this small GTP-ase, which is involved in intracellular membrane trafficking, is highly significant. 
Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 
these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 
sites, so gene expression responses to both hypoxia and cytokines are especially relevant. RAB-8b protein 
5 is induced in macrophages activated by LPS and gamma interferon. 

The Oxford BioMedica clone pICIO represents Regulator of G-protein signalling 1. The protein sequence 
encoded by Regulator of G-protein signalling 1 is represented in the public databases by the accession 
NP_002913 and is described in this patent by Seq ID 375. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_002922 and is described in this patent by Seq ID 376. 

10 Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. There is a prejudice in the art that the response to hypoxia is generic to all cell types. Contrary 
to this, we show that genes are regulated by hypoxia to a greater degree in certain cell types, 
substantiating their utility in designing specific therapeutic products for diseases involving those cell 

15 types. Monocytes and macrophages have been implicated in the following diseases involving hypoxia: 
rheumatoid arthritis, atherosclerosis, cancer, COPD. Regulator of G-protein signalling 1 is preferentially 
induced by hypoxia in monocytes or macrophages. Macrophages are key to several diseases involving 
hypoxia, and contribute to inflammatory processes. In these, macrophages are frequently activated by 
cytokines, which have been shown to be present at disease sites, so gene expression responses to both 

20 hypoxia and cytokines are especially relevant. Regulator of G-protein signalling 1 is repressed in 
macrophages activated by LPS and gamma interferon. Hypoxia is frequently found in human tumours 
where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, 
Regulator of G-protein signalling 1 is down-regulated in the malignant tissue as compared to adjacent 
normal tissue in at least one patient. 

25 The Oxford BioMedica clone plCll represents Polyubiquitin. The protein sequence encoded by 
Polyubiquitin is represented in the public databases by the accession BAA23632 and is described in this 
patent by Seq ID 377. The nucleotide sequence is represented in the public sequence databases by the 
accession AB009010 and is described in this patent by Seq ID 378. Hypoxia is an important feature of 
several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 

30 have utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to 
several diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. Polyubiquitin is repressed in 
macrophages activated by LPS and gamma interferon. TNFalpha is an inflammatory cytokine, which acts 
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on macrophages, and has been shown to be central to the pathophysiology and treatment of diseases 
including rheumatoid arthritis. Genes that change in expression in response to TNFalpha therefore have 
utility in the design of therapeutic, prognostic and diagnostic products for such inflammatory conditions. 
Polyubiquitin is induced in macrophages activated by TNFalpha, Hypoxia is frequently found in human 
5 tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast 
cancer, Polyubiquitin is up-regulated in the malignant tissue as compared to adjacent normal tissue in at 
least one patient. 

The Oxford BioMedica clone pi C 1 2 represents Integrin, alpha 5. The protein sequence encoded by 
Integrin, alpha 5 is represented in the public databases by the accession NP_002196 and is described in 

10 this patent by Seq ID 379. The nucleotide sequence is represented in the public sequence databases by the 
accession NM_002205 and is described in this patent by Seq ID 380, Hypoxia is an important feature of 
several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 
have utility in the design of .therapeutic, prognostic and diagnostic products. Integrin, alpha 5 may'play a 
role in the response to neuronal injury [King et al 2001, J Neurocytol 30:243-52], Our observation of 

15 hypoxia regulation of both COX-2 and integrin, alpha 5 supports the very recent suggestion that they may 
both function in recovery from cardiovascular injury [Hein et al 2001, Am J Physiol Heart Circ Physiol 
28 1 :H2378-84], which is pre-figured by our claims, Integrin, alpha 5 is induced by hypoxia in mammary 
epithelial cells, and may play an important role in cancer progression in that tissue through its function of 
regulating interaction with the extracellular matrix. 

20 The Oxford BioMedica clone pi C 1 3 represents Jk-recombination signal binding protein. The protein 
sequence encoded by Jk-recombination signal binding protein is represented in the public databases by 
the accession AAA60258 and is described in this patent by Seq ID 381. The nucleotide sequence is 
represented in the public sequence databases by the accession L07872 and is described in this patent by 
Seq ID 382. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 

25 are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
diagnostic products. Macrophages are key to several diseases involving hypoxia, and contribute to 
inflammatory processes. In these, macrophages are frequently activated by cytokines, which have been 
shown to be present at disease sites, so gene expression responses to both hypoxia and cytokines are 
especially relevant. Jk-recombination signal binding protein is repressed in macrophages activated by 

30 LPS and gamma interferon, TNFalpha is an inflammatory cytokine, which acts on macrophages, and has 
been shown to be central to the pathophysiology and treatment of diseases including rheumatoid arthritis. 
Genes that change in expression in response to TNFalpha therefore have utility in the design of 
therapeutic, prognostic and diagnostic products for such inflammatory conditions. Jk-recombination 
signal binding protein is induced in macrophages activated by TNFalpha, The important role of Jk- 
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recombination signal binding protein in the regulation of the immune response is thus modulated by 
hypoxia, and there are potentially many ways of exploiting that modulation in the design of diagnostics 
and therapeutics. Hypoxia is frequently found in human tumours where macrophage infiltrates are also 
found. In a series of 5 patients with either ovarian or breast cancer, Jk-recombination signal binding 
5 protein is down-regulated in the malignant tissue as compared to adjacent normal tissue in at least one 
patient. It is of particular interest and significance, in view of the escape from immunological surveillance 
of many tumours, that Jk-recombination signal binding protein is down-regulated. 

The Oxford BioMedica clone plC 14 represents Abstrakt. The protein sequence encoded by Abstrakt is 
represented in the public databases by the accession NP_057306 and is described in this patent by Seq ID 

10 383, The nucleotide sequence is represented in the public sequence databases by the accession 
NM_016222 and is described in this patent by Seq ID 384. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have' 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 

15 frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. Abstrakt is repressed in 
macrophages activated by IL-15. TNFalpha is an inflammatory cytokine, which acts on macrophages, and 
has been shown to be central to the pathophysiology and treatment of diseases including rheumatoid 
arthritis. Genes that change in expression in response to TNFalpha therefore have utility in the design of 

20 therapeutic, prognostic and diagnostic products for such inflammatory conditions. Abstrakt is induced in 
macrophages activated by TNFalpha. The general role of Abstrakt in the regulation of gene expression 
[Schmucker et al 2000, Mech Dev 91:189-96] implies particular significance to the recovery of cells from 
hypoxic insult. 

The Oxford BioMedica clone p 1 C 1 5 represents High-mobility group protein 2. The protein sequence 
25 encoded by High-mobility group protein 2 is represented in the public databases by the accession 
NP_002120 and is described in this patent by Seq ID 385. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_002129 and is described in this patent by Seq ID 386. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
30 products. 

The Oxford BioMedica clone pi C 1 6 represents Decidual protein induced by progesterone. The protein 
sequence encoded by Decidual protein induced by progesterone is represented in the public databases by 
the accession NP_008952 and is described in this patent by Seq ID 387. The nucleotide sequence is 
represented in the public sequence databases by the accession NM_O07021 and is described in this patent 
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by Seq ID 388. Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. Decidual protein induced by progesterone is preferentially induced by 
hypoxia in mammary epithelial cells. Human decidual cells have not been tested, but we predict that 
5 Decidual protein induced by progesterone is hypoxia-regulated in those cells.. Macrophages are key to 
several diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. Decidual protein induced by 
progesterone is repressed in macrophages activated by IL-17. 

10 The Oxford BioMedica clone plC19 represents GM2 ganglioside activator protein. The protein sequence 
encoded by GM2 ganglioside activator protein is represented in the public databases by the accession 
NP_000396 and is described in this patent by Seq ID 389. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_000405 and is described in this patent by Seq ID 390. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 

15 implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. There is a prejudice in the art that the response to hypoxia is generic to all cell types. Contrary 
to this, we show that genes are regulated by hypoxia to a greater degree in certain cell types, 
substantiating their utility in designing specific therapeutic products for diseases involving those cell 
types. Monocytes and macrophages have been implicated in the following diseases involving hypoxia: 

20 rheumatoid arthritis, atherosclerosis, cancer, COPD. GM2 ganglioside activator protein is preferentially 
induced by hypoxia in monocytes or macrophages. The hypoxia-inducibility of this protein in 
macrophages is likely to be clinically very significant. It is likely to play a role in the control of 
inflammation in asthma and inflammatory bowel disease, and in lipid metabolism and 
phosphatidylinositol-mediated signalling. 

25 The Oxford BioMedica clone plC20 represents CN0T8. The protein sequence encoded by CN0T8 is 
represented in the public databases by the accession NPJ)04770 and is described in this patent by Seq ID 
391. The nucleotide sequence is represented in the public sequence databases by the accession 
NM _004779 and is described in this patent by Seq ID 392. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 

30 utility in the design of therapeutic, prognostic and diagnostic products. 

The protein sequence encoded by Similar to Nucleoside phosphorylase is not represented in the public 
databases by a protein accession. The nucleotide sequence is represented in the public sequence databases 
by the accession AA430382 and is described in this patent by Seq ID 394. Hypoxia is an important 
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feature of several diseases, and genes that respond to this stimulus are therefore implicated in the 
pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone pi P5 represents SCYA2. The protein sequence encoded by SCYA2 is 
represented in the public databases by the accession NP_002973 and is described in this patent by Seq ID 
5 395, The nucleotide sequence is represented in the public sequence databases by the accession 
NM_002982 and is described in this patent by Seq ID 396. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 

10 frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. SCYA2 is induced in 
macrophages activated by LPS and gamma interferon and is also induced in macrophages activated by IL- 
17 or IL-15. Thus the role of SCYA2 in monocyte recruitment [Lu et al 1998, J Exp Med 187:601-8], 
which has clear relevance to the diagnosis and treatment of cardiovascular disease, cancer, rheumatoid 

15 arthritis, atherosclerosis and COPD, is enhanced by hypoxia. TNFalpha is an inflammatory cytokine, 
which acts on macrophages, and has been shown to be central to the pathophysiology and treatment of 
diseases including rheumatoid arthritis. Genes that change in expression in response to TNFalpha 
therefore have utility in the design of therapeutic, prognostic and diagnostic products for such 
inflammatory conditions. SCYA2 is repressed in macrophages activated by TNFalpha. Hypoxia is 

20 frequently found in human tumours where macrophage infiltrates are also found. In a series of 5 patients 
with either ovarian or breast cancer, SYCA2 is down-regulated in the malignant tissue as compared to 
adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone p2L23 represents Endothelin 1. The protein sequence encoded by 
Endothelin 1 is represented in the public databases by the accession NP_001946 and is described in this 

25 patent by Seq ID 397. The nucleotide sequence is represented in the public sequence databases by the 
accession NM_001955 and is described in this patent by Seq ID 398. Hypoxia is an important feature of 
several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 
have utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to 
several diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 

30 frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. Endothelin 1 is induced in 
macrophages activated by LPS and gamma interferon, TNFalpha is an inflammatory cytokine, which acts 
on macrophages, and has been shown to be central to the pathophysiology and treatment of diseases 
including rheumatoid arthritis. Genes that change in expression in response to TNFalpha therefore have 
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utility in the design of therapeutic, prognostic and diagnostic products for such inflammatory conditions. 
Endothelin 1 is induced in macrophages activated by TNFalpha. Endothelin 1 plays an important role in 
inducing proliferation of vascular smooth muscle cells. Its hypoxia-inducibility and thus its modulation to 
ameliorate the consequences of ischaemic insult, is of considerable clinical significance to the recovery 
5 from injury, and angiogenesis. 

The protein sequence encoded by Similar to Heat shock 70kD protein 4 is not represented in the public 
databases by a protein accession. The nucleotide sequence is represented in the public sequence databases 
by the accession AA633656 and is described in this patent by Seq ID 400. Hypoxia is an important 
feature of several diseases, and genes that respond to this stimulus are therefore implicated in the 
10 pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone plK9 represents Lipocortin I. The protein sequence encoded by Lipocortin 
I is represented in the public databases by the accession NP_000691 and is described in this patent by Seq 
ID 401. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_000700 and is described in this patent by Seq ID 402. Hypoxia is an important feature of several 

15 diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Lipocortin I (also called annexin I) 
is an important anti-inflammatory mediator, and its hypoxia-inducibility has important implications for 
the diagnosis and treatment of ischaemic disease, cancer, atherosclerosis, and inflammatory diseases such 
as rheumatoid arthritis. Hypoxia is frequently found in human tumours where macrophage infiltrates are 

20 also found. In a series of 5 patients with either ovarian or breast cancer, Lipocortin I is up-regulated in the 
malignant tissue as compared to adjacent normal tissue in at least one patient. Hypoxia is frequently 
found in human tumours where macrophage infiltrates are also found. In a series of 5 patients with either 
ovarian or breast cancer, Lipocortin I is down-regulated in the malignant tissue as compared to adjacent 
normal tissue in at least one patient. 

25 The Oxford BioMedica clone plK23 represents MYC. The protein sequence encoded by MYC is 
represented in the public databases by the accession NP_002458 and is described in this patent by Seq ID 
403. The nucleotide sequence is represented in the public sequence databases by the accession 
NML002467 and is described in this patent by Seq ID 404. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 

30 utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. MYC is repressed in 
macrophages activated by LPS and gamma interferon. Hypoxia is frequently found in human tumours 
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where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, 
MYC is up-regulated and also down-regulated in the malignant tissue as compared to adjacent normal 
tissue in at least one patient. 

The Oxford BioMedica clone plK15 represents Alpha-2-macroglobulin. The protein sequence encoded 
5 by Alpha-2-macroglobulin is represented in the public databases by the accession NPJJ00005 and is 
described in this patent by Seq ID 405. The nucleotide sequence is represented in the public sequence 
databases by the accession NM JJ00014 and is described in this patent by Seq ID 406. Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. There is 

10 a prejudice in the art that the response to hypoxia is generic to all cell types. Contrary to this, we show 
that genes are regulated by hypoxia to a greater degree in certain cell types, substantiating their utility in 
designing specific therapeutic" products for diseases involving those cell types. Monocytes and 
macrophages have been implicated in the following diseases involving hypoxia; rheumatoid arthritis, 
atherosclerosis, cancer, COPD. Alpha-2-macroglobulin is preferentially induced by hypoxia in monocytes 

15 or macrophages. Hypoxia is frequently found in human tumours where macrophage infiltrates are also 
found. In a series of 5 patients with either ovarian or breast cancer, Alpha-2-macroglobulin is down- 
regulated in the malignant tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone pIK8 represents SCYA4. The protein sequence encoded by SCYA4 is 
represented in the public databases by the accession XP_008449 and is described in this patent by Seq ID 

20 407. The nucleotide sequence is represented in the public sequence databases by the accession 
XM_008449 and is described in this patent by Seq ID 408. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 

25 frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. SCYA4 is induced in 
macrophages activated by LPS and gamma interferon and is also induced in macrophages activated by IL- 
17. TNFalpha is an inflammatory cytokine, which acts on macrophages, and has been shown to be central 
to the pathophysiology and treatment of diseases including rheumatoid arthritis. Genes that change in 

30 expression in response to TNFalpha therefore have utility in the design of therapeutic, prognostic and 
diagnostic products for such inflammatory conditions. SCYA4 is induced in macrophages activated by 
TNFalpha. SCYA4 is a chemokine which is likely to be significant in inflammatory disease as a direct 
result of its hypoxic regulation. Hypoxia is frequently found in human tumours where macrophage 
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infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, SCYA4 is down- 
regulated in the malignant tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone plM24 represents Sex hormone-binding globulin, The protein sequence 
encoded by Sex hormone-binding globulin is represented in the public databases by the accession 
5 NP_001031 and is described in this patent by Seq ID 409. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_001040 and is described in this patent by Seq ID 410. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 

10 The Oxford BioMedica clone plK7 represents ATP-binding cassette El. The protein sequence encoded 
by ATP-binding cassette El is represented in the public databases by the accession NP_002931 and is 
described in this patent by Seq ID 411. The nucleotide sequence is represented in the public sequence 
databases by the accession NM_002940 and is described in this patent by Seq ID 412. Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 

15 the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 
these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 
sites, so gene expression responses to both hypoxia and cytokines are especially relevant. ATP-binding 
cassette El is repressed in macrophages activated by LPS and gamma interferon. 

20 The Oxford BioMedica clone pi K 1 6 represents CCT6A. The protein sequence encoded by CCT6A is 
represented in the public databases by the accession NP_001753 and is described in this patent by Seq ID 
413. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_001762 and is described in this patent by Seq ID 414. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 

25 utility in the design of therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone p 1 K 1 8 represents Colony-stimulating factorl. The protein sequence 
encoded by Colony-stimulating factorl is represented in the public databases by the accession AAA521 17 
and is described in this patent by Seq ID 415. The nucleotide sequence is represented in the public 
sequence databases by the accession M37435 and is described in this patent by Seq ID 416. Hypoxia is an 
30 important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes, In 
these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 
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sites, so gene expression responses to both hypoxia and cytokines are especially relevant. Colony- 
stimulating factorl is repressed in macrophages activated by LPS and gamma interferon. 

The Oxford BioMedica clone plNl represents GA17. The protein sequence encoded by GA17 is 
represented in the public databases by the accession NP_006351 and is described in this patent by Seq ID 
5 417. The nucleotide sequence is represented in the public sequence databases by the accession 
NM J106360 and is described in this patent by Seq ID 418. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone plK22 represents GPR44. The protein sequence encoded by GPR44 is 
10 represented in the public databases by the accession NP_004769 and is described in this patent by Seq ID 
419. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_004778 and is described in this patent by Seq ID 420. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
15 diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. GPR44 is repressed in 
macrophages activated by LPS and gamma interferon. GPR44 is most similar to the chemoattractant 
GPCR's [Marchese et al 1999, Genomics 1999 Feb 15;56(1):12-21], Our demonstration of its hypoxic 
20 regulation enables prediction of roles in diseases associated with transient hypoxia and macrophages. 
GPCR's are a druggable class of molecules, and represent an ideal route for pharmacological 
intervention. 

The Oxford BioMedica clone plK14 represents Keratin 6B. The protein sequence encoded by Keratin 6B 
is represented in the public databases by the accession NP_005546 and is described in this patent by Seq 

25 ID 421. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_005555 and is described in this patent by Seq ID 422. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Keratin 6B is induced in 
macrophages treated with the inhibitory cytokine IL-10. Keratin 6B is repressed in macrophages activated 

30 by IL-17 and is also repressed in macrophages activated by IL-15. 

The Oxford BioMedica clone p 1 K 1 3 represents Lymphocyte adaptor protein. The protein sequence 
encoded by Lymphocyte adaptor protein is represented in the public databases by the accession 
NPJ)05466 and is described in this patent by Seq ID 423. The nucleotide sequence is represented in the 
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public sequence databases by the accession NM J)05475 and is described in this patent by Seq ID 424. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products, 

5 The Oxford BioMedica clone pi J20 represents Neuro-oncological ventral antigen 1 . The protein sequence 
encoded by Neuro-oncological ventral antigen 1 is represented in the public databases by the accession 
NP_002506 and is described in this patent by Seq ID 425. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_002515 and is described in this patent by Seq ID 426. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
10 implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In 
a series of 5 patients with either ovarian or breast cancer, Neuro-oncological ventral antigen 1 is down- 
regulated in the malignant tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone plJ22 represents Neutral sphingomyelinase (N-SMase) activation 
15 associated factor. The protein sequence encoded by Neutral sphingomyelinase (N-SMase) activation 
associated factor is represented in the public databases by the accession NP.003571 and is described in 
this patent by Seq ID 427. The nucleotide sequence is represented in the public sequence databases by the 
accession NM_00358O and is described in this patent by Seq ID 428. Hypoxia is an important feature of 
several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 
20 have utility in the design of therapeutic, prognostic and diagnostic products. Neutral sphingomyelinase 
(N-SMase) activation associated factor is induced in macrophages treated with the inhibitory cytokine IL- 
10. Neutral sphingomyelinase (N-SMase) activation associated factor is repressed in macrophages 
activated by IL-17 and is also repressed in macrophages activated by IL-15. We expect activation of of 
Neutral sphingomyelinase (N-SMase) to have an anti-inflammatory effect. This enzyme is known to 
25 modulate the sphingomyelin second messenger cycle, potentially interacting with the oxidative system. 
Our demonstration of hypoxic regulation provides a crucial indication of the benefit of therapeutic 
intervention via sphingomyelinase (N-SMase) for the treatment of inflammatory diseases and diseases 
related to the hypoxic macrophage. 

The Oxford BioMedica clone plKl represents Cyclophilin F. The protein sequence encoded by 
30 Cyclophilin F is represented in the public databases by the accession NP_005720 and is described in this 
patent by Seq ID 429. The nucleotide sequence is represented in the public sequence databases by the 
accession NM_005729 and is described in this patent by Seq ID 430. Hypoxia is an important feature of 
several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 
have utility in the design of therapeutic, prognostic and diagnostic products. Hypoxia is frequently found 
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in human tumours where macrophage infiltrates are also found. In a series of 5 patients with either 
ovarian or breast cancer, Cyclophilin F is up-regulated in the malignant tissue as compared to adjacent 
normal tissue in at least one patient. 

The Oxford BioMedica clone plK3 represents Pleckstrin, The protein sequence encoded by Pleckstrin is 
5 represented in the public databases by the accession NP„002655 and is described in this patent by Seq ID 
431. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_002664 and is described in this patent by Seq ID 432. Hypoxia is an important feature of several 
diseases, and genes. that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. There is a prejudice in the art that 

10 the response to hypoxia is generic to all cell types. Contrary to this, we show that genes are regulated by 
hypoxia to a greater degree in certain cell types, substantiating their utility in designing specific 
therapeutic products for diseases involving those cell types. Monocytes and macrophages have been 
implicated in the following diseases involving hypoxia; rheumatoid arthritis, atherosclerosis, cancer, 
COPD. Pleckstrin is preferentially induced by hypoxia in monocytes or macrophages. Macrophages are 

15 key to several diseases involving hypoxia, and contribute to inflammatory processes. In these, 
macrophages are frequently activated by cytokines, which have been shown to be present at disease sites, 
so gene expression responses to both hypoxia and cytokines are especially relevant. Pleckstrin is induced 
in macrophages activated by LPS and gamma interferon. Hypoxia is frequently found in human tumours 
where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, 

20 Pleckstrin is down-regulated in the malignant tissue as compared to adjacent normal tissue in at least one 
patient. 

The Oxford BioMedica clones p 1 J 1 9 and plK2 represent CFFM4. The protein sequence encoded by 
CFFM4 is represented in the public databases by the accession NP_067024 and is described in this patent 
by Seq ID 433. The nucleotide sequence is represented in the public sequence databases by the accession 

25 NM_021201 and is described in this patent by Seq ID 434. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. There is a prejudice in the art that 
the response to hypoxia is generic to all cell types, Contrary to this, we show that genes are regulated by 
hypoxia to a greater degree in certain cell types, substantiating their utility in designing specific 

30 therapeutic products for diseases involving those cell types. Monocytes and macrophages have been 
implicated in the following diseases involving hypoxia: rheumatoid arthritis, atherosclerosis, cancer, 
COPD. CFFM4 is preferentially induced by hypoxia in monocytes or macrophages, CFFM4 is induced in 
macrophages treated with the inhibitory cytokine IL-10, It has been suggested recently that CFFM4 is 
associated with mature cellular function in the monocytic lineage and that it may be a component of a 
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receptor complex involved in signal transduction [Gingras et al 2001, Immunogenetics 53:468-76], Our 
demonstration of hypoxic-regulation opens possible routes of intervention in macrophage-related disease 
via this potentially important cell surface receptor, Hypoxia is frequently found in human tumours where 
macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, 
5 CFFM4 is down-regulated in the malignant tissue as compared to adjacent normal tissue in at least one 
patient. 

The Oxford BioMedica clone plK5 represents Ribosomal protein L36a. The protein sequence encoded by 
Ribosomal protein L36a is represented in the public databases by the accession NP_000992 and is 
described in this patent by Seq ID 435. The nucleotide sequence is represented in the public sequence 
10 databases by the accession NM_001001 and is described in this patent by Seq ID 436. Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone p 1 Jl 7 represents SLC6A1. The protein sequence encoded by SLC6A1 is 
represented in the public databases by the accession NPJJO3033 and is described in this patent by Seq ID 

15 437. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_003042 and is described in this patent by Seq ID 438. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Hypoxia is frequently found in 
human tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian 

20 or breast cancer, SLC6A1 is up-regulated and also down-regulated in the malignant tissue as compared to 
adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone p 1 Jl 8 represents Synaptopodin. The protein sequence encoded by 
Synaptopodin is represented in the public databases by the accession NP_009217 and is described in this 
patent by Seq ID 439. The nucleotide sequence is represented in the public sequence databases by the 

25 accession NM J)07286 and is described in this patent by Seq ID 440. Hypoxia is an important feature of 
several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 
have utility in the design of therapeutic, prognostic and diagnostic products. Synaptopodin is a component 
of the cytoskeleton which has particular importance in neurons, where it is involved in synaptic plasticity. 
Its hypoxia-regulation is clearly potentially significant in the context of neurological disease. Hypoxia is 

30 frequently found in human tumours where macrophage infiltrates are also found. In a series of 5 patients 
with either ovarian or breast cancer, Synaptopodin is up-regulated and also down-regulated in the 
malignant tissue as compared to adjacent normal tissue in at least one patient. 



WO 02/46465 



PCT/GB01/05458 



183 

The Oxford BioMedica clone p 1 J 1 5 represents TERA protein. The protein sequence encoded by TERA 
protein is represented in the public databases by the accession NP_067061 and is described in this patent 
by Seq ID 441. The nucleotide sequence is represented in the public sequence databases by the accession 
NM..021238 and is described in this patent by Seq ID 442. Hypoxia is an important feature of several 
5 diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Hypoxia is frequently found in 
human tumours where macrophage infiltrates are also found, In a series of 5 patients with either ovarian 
or breast cancer, TERA protein is up-regulated and also down-regulated in the malignant tissue as 
compared to adjacent normal tissue in at least one patient. 

10 The Oxford BioMedica clone plK4 represents TSC-22. The protein sequence encoded by TSC-22 is 
represented in the public databases by the accession NP_006013 and is described in this patent by Seq ID 
"443, The nucleotide sequence is represented in the public sequence databases by the accession 
NM_006022 and is described in this patent by Seq ID 444. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 

15 utility in the design of therapeutic, prognostic and diagnostic products. TSC-22 is a transcriptional 
regulator of the leucine zipper class, and its hypoxic regulation is likely to have significant downstream 
effects which may be related to ischaemic disease. Thus it may provide important points of intervention in 
such diseases. Macrophages are key to several diseases involving hypoxia, and contribute to 
inflammatory processes. In these, macrophages are frequently activated by cytokines, which have been 

20 shown to be present at disease sites, so gene expression responses to both hypoxia and cytokines are 
especially relevant. TSC-22 is repressed in macrophages activated by LPS and gamma interferon. 
Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In a series of 
5 patients with either ovarian or breast cancer, TSC-22 is down-regulated in the malignant tissue as 
compared to adjacent normal tissue in at least one patient. 

25 The Oxford BioMedica clone p2A 14 represents an unannotated EST. The protein sequence encoded by 
this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession AA9881 10 and is described in this patent 
by Seq ID 446, Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 

30 prognostic and diagnostic products. The EST represented by Seq ID 446 is induced in macrophages 
treated with the inhibitory cytokine IL-10. The EST represented by Seq ID 446 is repressed in 
macrophages activated by IL-17 and is also repressed in macrophages activated by IL-15. 

The Oxford BioMedica clone plJ23 represents Calgranulin A. The protein sequence encoded by 
Calgranulin A is represented in the public databases by the accession NP_002955 and is described in this 
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patent by Seq ID 447. The nucleotide sequence is represented in the public sequence databases by the 
accession NM.002964 and is described in this patent by Seq ID 448. Hypoxia is an important feature of 
several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 
have utility in the design of therapeutic, prognostic and diagnostic products, Calgranulin A, called by its 
5 synonym S100A8, has been cited recently as "wound-regulated" [Thorey et al 2001, J Biol Chem 
276:358 1 8-25] which provides less precise support for our prior determination of its hypoxia-regulation. 
In its potential role as a.chemoattractant, it would be an important point of intervention for the modulation 
of inflammatory processes. Macrophages are key to several diseases involving hypoxia, and contribute to 
inflammatory processes. In these, macrophages are frequently activated by cytokines, which have been 
10 shown to be present at disease sites, so gene expression responses to both hypoxia and cytokines are 
especially relevant. Calgranulin A is repressed in macrophages activated by LPS and gamma interferon. 
„Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In a series of 
5 patients with either ovarian or breast cancer, Calgranulin A is down-regulated in the malignant tissue as 
compared to adjacent normal tissue in at least one patient. 

15 The Oxford BioMedica clone p 1 J2 1 represents Replication factor C large subunit. The protein sequence 
encoded by Replication factor C large subunit is represented in the public databases by the accession 
NP_002904 and is described in this patent by Seq ID 449. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_002913 and is described in this patent by Seq ID 450. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 

20 implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 

The Oxford BioMedica clone pi J24 represents Signal recognition particle 19kD. The protein sequence 
encoded by Signal recognition particle 19kD is represented in the public databases by the accession 
NPJJ03126 and is described in this patent by Seq ID 451. The nucleotide sequence is represented in the 
25 public sequence databases by the accession NM_003135 and is described in this patent by Seq ID 452. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 

The Oxford BioMedica clone pi J16 represents cDNA: FLJ23019 fis, clone LNG00916. The protein 
30 sequence encoded by cDNA: FLJ23019 fis, clone LNG00916 is not represented in the public databases 
by a protein accession. The nucleotide sequence is represented in the public sequence databases by the 
accession AK026672 and is described in this patent by Seq ID 454. Hypoxia is an important feature of 
several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 
have utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to 
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several diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. The cDNA: FLJ23019 fis, 
clone LNG00916 is induced in macrophages activated by LPS and gamma interferon and is also induced 
5 in macrophages activated by IL-15. 

The Oxford BioMedica clone plJ2 represents Proteasome subunit, alpha type, 4. The protein sequence 
encoded by Proteasome subunit, alpha type, 4 is represented in the public databases by the accession 
NP_002780 and is described in this patent by Seq ID 455. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_002789 and is described in this patent by Seq ID 456. 
10 Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 

The Oxford BioMedica clone p 1 J9 represents MAFB, The protein sequence encoded by MAFB is 
represented in the public databases by the accession NP JJ05452 and is described in this patent by Seq ID 

15 457. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_005461 and is described in this patent by Seq ID 458. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. MAFB is a transcriptional 
regulator of the leucine zipper type, and is likely to play an important role in the mediation of the hypoxic 

20 response, with attendant relevance to associated diseases. 

The Oxford BioMedica clone plJlO represents DNCLI2. The protein sequence encoded by DNCLI2 is 
represented in the public databases by the accession NP_006132 and is described in this patent by Seq ID 
459. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_006141 and is described in this patent by Seq ID 460. Hypoxia is an important feature of several 
25 diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Hypoxia is frequently found in 
human tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian 
or breast cancer, gene X is down-regulated in the malignant tissue as compared to adjacent normal tissue 
in at least one patient. 

30 The Oxford BioMedica clone p 1 J 1 represents Chromobox homolog 3. The protein sequence encoded by 
Chromobox homolog 3 is represented in the public databases by the accession NP_057671 and is 
described in this patent by Seq ID 461. The nucleotide sequence is represented in the public sequence 
databases by the accession NM_016587 and is described in this patent by Seq ID 462. Hypoxia is an 
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important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone plJ5 represents SCYA7. The protein sequence encoded" by SCYA7 is 
represented in the public databases by the accession NP_006264 and is described in this patent by Seq ID 
5 463. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_006273 and is described in this patent by Seq ID 464. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 

10 frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. SCYA7 is induced in 
macrophages activated by IL-15. SCYA7 is a chemoattractant protein which, considering its hypoxia- 
regulation, is likely to play an important role in inflammatory and ischaemic disease, TNFalpha is an 
inflammatory cytokine, which acts on macrophages, and has been shown to be central to the 

15 pathophysiology and treatment of diseases including rheumatoid arthritis. Genes that change in 
expression in response to TNFalpha therefore have utility in the design of therapeutic, prognostic and 
diagnostic products for such inflammatory conditions. SCYA7 is repressed in macrophages activated by 
TNFalpha. 

The Oxford BioMedica clone plJll represents Fatty-acid-Coenzyme A ligase, long-chain 2. The protein 
20 sequence encoded by Fatty-acid-Coenzyme A ligase, long-chain 2 is represented in the public databases 
by the accession NP_066945 and is described in this patent by Seq ID 465. The nucleotide sequence is 
represented in the public sequence databases by the accession NM _021122 and is described in this patent 
by Seq ID 466. Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
25 prognostic and diagnostic products. Macrophages are key to several diseases involving hypoxia, and 
contribute to inflammatory processes. In these, macrophages are frequently activated by cytokines, which 
have been shown to be present at disease sites, so gene expression responses to both hypoxia and 
cytokines are especially relevant. Fatty-acid-Coenzyme A ligase, long-chain 2 is induced in macrophages 
activated by LPS and gamma interferon and also induced in macrophages activated by IL-17 or IL-15. 
30 Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In a series of 
5 patients with either ovarian or breast cancer, Fatty-acid-Coenzyme A ligase, long-chain 2 is down- 
regulated in the malignant tissue as compared to adjacent normal tissue in at least one patient. 

• The Oxford BioMedica clone pi J8 represents Programmed cell death 5. The protein sequence encoded by 
Programmed cell death 5 is represented in the public databases by the accession NP_004699 and is 
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described in this patent by Seq ID 467. The nucleotide sequence is represented in the public sequence 
databases by the accession NM J04708 and is described in this patent by Seq ID 468. Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 

5 The Oxford BioMedica clone plI20 represents SCYA3L. The protein sequence encoded by SCYA3L is 
represented in the public databases by the accession CAA36397 and is described in this patent by Seq ID 
469. The nucleotide sequence is represented in the public sequence databases by the accession X52149 
and is described in this patent by Seq ID 470. Hypoxia is an important feature of several diseases, and 
genes that respond to this stimulus are therefore implicated in the pathogenesis and have utility in the 

10 design of therapeutic, prognostic and diagnostic products. There is a prejudice in the art that the response 
to hypoxia is generic to all cell types. Contrary to this, we show that genes are regulated by hypoxia to a 
greater degree in certain cell types, substantiating their utility in designing specific therapeutic products 
for diseases involving those cell types. Monocytes and macrophages have been implicated in the 
following diseases involving hypoxia: rheumatoid arthritis, atherosclerosis, cancer, COPD. SCYA3L is 

15 preferentially induced by hypoxia in monocytes or macrophages. Macrophages are key to several diseases 
involving hypoxia, and contribute to inflammatory processes. In these, macrophages are frequently 
activated by cytokines, which have been shown to be present at disease sites, so gene expression 
responses to both hypoxia and cytokines are especially relevant. SCYA3L is induced in macrophages 
activated by LPS and gamma interferon. TNFalpha is an inflammatory cytokine, which acts on 

20 macrophages, and has been shown to be central to the pathophysiology . and treatment of diseases 
including rheumatoid arthritis. Genes that change in expression in response to TNFalpha therefore have 
utility in the design of therapeutic, prognostic and diagnostic products for such inflammatory conditions. 
SCYA3L is induced in macrophages activated by TNFalpha. 

The Oxford BioMedica clone plJ3 represents Furin. The protein sequence encoded by Furin is 
25 represented in the public databases by the accession NP_002560 and is described in this patent by Seq ID 
471. The nucleotide sequence is represented in the public sequence databases by the accession 
NM JJ02569 and is described in this patent by Seq ID 472. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. 

30 The Oxford BioMedica clone p 1 Jl 2 represents Nuclear autoantigenic sperm protein. The protein 
sequence encoded by Nuclear autoantigenic sperm protein is represented in the public databases by the 
accession NP_002473 and is described in this patent by Seq ID 473. The nucleotide sequence is 
represented in the public sequence databases by the accession NM_002482 and is described in this patent 
by Seq ID 474. Hypoxia is an important feature of several diseases, and genes that respond to this 
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stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. 

The Oxford BioMedica clone pi 123 represents Ecotropic viral integration site 2A. The protein sequence 
encoded by Ecotropic viral integration site 2A is represented in the public databases by the accession 
5 NP_055025 and is described in this patent by Seq ID 475. The nucleotide sequence is represented in the 
public sequence databases by the accession NMJH4210 and is described in this patent by Seq ID 476. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. There is a prejudice in the art that the response to hypoxia is generic to all cell types. Contrary 

10 to this, we show that genes are regulated by hypoxia to a greater degree in certain cell types, 
substantiating their utility in designing specific therapeutic products for diseases involving those cell 
types* Monocytes and macrophages have been implicated in the following diseases involving hypoxia: 
rheumatoid arthritis, atherosclerosis, cancer, COPD. Ecotropic viral integration site 2A is preferentially 
induced by hypoxia in monocytes or macrophages. Macrophages are key to several diseases involving 

15 hypoxia, and contribute to inflammatory processes. In these, macrophages are frequently activated by 
cytokines, which have been shown to be present at disease sites, so gene expression responses to both 
hypoxia and cytokines are especially relevant. Ecotropic viral integration site 2A is repressed in 
macrophages activated by LPS and gamma interferon. Hypoxia is frequently found in human tumours 
where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, 

20 Ecotropic viral integration site 2A is down-regulated in the malignant tissue as compared to adjacent 
normal tissue in at least one patient. 

The Oxford BioMedica clone plJ7 represents Sjogren syndrome antigen B. The protein sequence 
encoded by Sjogren syndrome antigen B is represented in the public databases by the accession 
NP__003133 and is described in this patent by Seq ID 477. The nucleotide sequence is represented in the 

25 public sequence databases by the accession NM_003142 and is described in this patent by Seq ID 478. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. Sjogren syndrome antigen B is preferentially induced by hypoxia in mammary epithelial cells. 
Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 

30 these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 
sites, so gene expression responses to both hypoxia and cytokines are especially relevant. Sjogren 
syndrome antigen B is induced in macrophages activated by LPS and gamma interferon. 

The Oxford BioMedica clone pi 121 represents SCYA8. The protein sequence encoded by SCYA8 is 
represented in the public databases by the accession NP_005614 and is described in this patent by Seq ID 
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479. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_005623 and is described in this patent by Seq ID 480, Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
5 diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. SCYA8 is induced in 
macrophages activated by LPS and gamma interferon and is also induced in macrophages activated by IL- 
15. 

10 The Oxford BioMedica clone p 1 II 9 represents GR02. The protein sequence encoded by GR02 is 
represented in the public databases by the accession NP_002080 and is described in this patent by Seq ID 
481. The nucleotide sequence is represented in the public sequence databases by the accession 
N1VL002089 and is described in this patent by Seq ID 482. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 

15 utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes, In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant, GR02 is induced in 
macrophages activated by LPS and gamma interferon and is also induced in macrophages activated by IL- 

20 17. GR02 encodes a chemokine which is likely to be involved in the inflammatory response. Its induction 
by hypoxia provides a potential route for intervention in diseases related to inflammation and ischaemia. 
Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In a series of 
5 patients with either ovarian or breast cancer, GR02 is down-regulated in the malignant tissue as 
compared to adjacent normal tissue in at least one patient. 

25 The Oxford BioMedica clone plJ4 represents Small nuclear ribonucleoprotein Dl. The protein sequence 
encoded by Small nuclear ribonucleoprotein Dl is represented in the public databases by the accession 
NP_008869 and is described in this patent by Seq ID 483. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_006938 and is described in this patent by Seq ID 484. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 

30 implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 

The Oxford BioMedica clone pi 124 represents GR01. The protein sequence encoded by GR01 is. 
represented in the public databases by the accession NP_001502 and is described in this patent by Seq ID 
485. The nucleotide sequence is represented in the public sequence databases by the accession 
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NM_001511 and is described in this patent by Seq ID 486, GR01 has known chemotactic activity for 
neutrophils, GR01 belongs to the intercrine alpha family of small CXC cytokines. GR01 encodes a 
chemokine which is likely to be involved in the inflammatory response, Its induction by hypoxia provides 
a potential route for intervention in diseases related to inflammation and ischaemia. Hypoxia is an 
5 important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products, 
Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 
these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 
sites, so gene expression responses to both hypoxia and cytokines are especially relevant, GR01 is 
10 induced in macrophages activated by LPS and gamma interferon and is also induced in macrophages 
activated by IL-17. Hypoxia is frequently found in human tumours where macrophage infiltrates are also 
found. In a series of 5 patients with either ovarian or breast cancer, GR01 is down-regulated in the 
malignant tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone p 1 II 8 represents Selectin L. The protein sequence encoded by Selectin L is 
15 represented in the public databases by the accession NP_000646 and is described in this patent by Seq ID 
487., The nucleotide sequence is represented in the public sequence databases by the accession 
NM_000655 and is described in this patent by Seq ID 488. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products, Selectin L shedding by leucocytes 
20 is one aspect of the induction of the inflammatory response. Hypoxic-regulation of Selectin L is clearly a 
significant factor in the induction of inflammation following ischaemic insult or in diseases in which 
transient ischaemic conditions occur. Modulation of this induction is one aspect of the present invention. 
Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In a series of 
5 patients with either ovarian or breast cancer, Selectin L is down-regulated in the malignant tissue as 
25 compared to adjacent normal tissue in at least one patient, 
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TABLES 

TABLE 1: Hypoxia-inducible genes identified from clones only derived from the cardiomyoblast library 





SEQ ID 




\J JCi 11 Ej LV ft 1V1 Ej 


protein 


nucleotide 


ft Lte&MOii 


















Diacylglycerol kinase, zeta 


353 


354 


XT HA" nnn/j/ 

NML003646 


CCR4 associated factor l 


391 


392 


AF053318 


GM2 ganglioside activator protein. 


389 


390 


X62078 


Granuhn 


269 


270 


AK000607 


Serine protease 11 


355 


356 


Y07921 


tt ♦ i ^ 1 » i * i ri * * 

High mobility group 2 protein 


385 


386 


M83665 


Decidual protein induced by 
progesterone 


387 


388 


NM_007021 


v CiRu -dox protein aostraKi 


383 


384 


MH ft 1 A O 1 0 


IL-1 receptor antagonist 


357 


358 


U65590 


KIAA1376 protein 


29 


30 


AB037797 


Hypothetical protein KIAA0127 


31 


32 


D50917 


Hypothetical protein FLJ20308 


33 


34 


AL137263 


EST 


91 


92 


AL390082 


EST 


89 


90 


AL117352 


EST 


77 


78 


AW664180 



5 



WO 02/46465 PCT/GB01/05458 

192 

TABLE 2: Hypoxia-inducible genes identified from clones only derived from the macrophage libraries 





SEQ ID 




ublN h JN AMii 


protein 


nucleotide 


Accession 










Metallothionein-2a 


265 


266 


J00271 


Metallothionein-1 h 


239 


240 


X64177 


Metallotnionein-1 u 


243 


244 


J03910 


Interleukin 8 


251 


252 


Y00787 


Lactate dehydrogenase A 


223 


224 


NM_005566 


UDP-glucose pyrophosphorylase 2 


347 


348 


NM_006759 


Enolase l 


257 


258 


NM_001428 


Enolase 2 


273 


274 


NM.001975 


Tissue factor / coagulation factor III / 
thromboplastin 


225 


226 


NM.001993 


proline 4-hydroxylase, alpha polypeptide 1 


231 


232 


NM_000917 


proline 4-hydroxylase, alpha polypeptide II 


349 


350 


NM_004199 


N S 1 -binding protein 


359 


360 


NM_006469 


FGF receptor activating protein 1 


363 


364 


AF159621 


Adenylate kinase 3 


263 


264 


XT \ K A 1 1 A 1 A 

NM_0l34l0 


Osteopontin 


267 


268 


X13694 


Aldolase C, fructose-bisphosphate 


259 


260 


XT \ If A A 1 /" f 

NM_005165 


ualectin-o 


365 


366 


A T" 1 1 A 1 f) A /* 

AF193806 


Regulator oi 0-protein signalling 1 (BL34) 


375 


376 


o r* a a <i n 

S59049 


Dn1ifiA!nn!tI« TTU P 

rolyubiqumn U bL 


377 


378 


A T"l A A A A 1 A 

AB009010 


Activin A receptor type I 


361 


362 


XTX/T AA1 1 rtf 

NM_001105 


Glyceraldehyde-3 -phosphate dehydrogenase 


253 


254 


XT X J AAAAvf/' 

NM_002046 


Phosphoglycerate kinase 1 


IKS 


ZjO 


XTX/T A A A A A 1 

NM_00029l 


i\ a u 0 u 


373 


374 


\ry n 1 £ ^ 1 fi 
IN M_U 1 0 J j\J 


Glucose phosphate isom erase 


367 


368 


NM_000175 


D123 gene product (HT1080) 


369 


370 


U27112 


Integrin alpha 5 


379 


380 


NM_002205 


Triosephosphate isomerase 1 


261 


262 


NM_000365 


solute carrier family 31 (copper transporters), 


345 


346 


NM_001860 
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member 2 








Jk-recombination signal binding protein 


381 


382 


L07872 


N-myc downstream regulated (NDRG1/RTP) 


229 


230 


D87953 


Plasminogen activator inhibitor- 1 


235 


236 


M16006 


Dec-1 


371 


372 


NM_003670 


FUSIN / CXCR4 


331 


332 


NM_003467 


Hypothetical protein FLJ20500 


25 


26 


AK000507 


DKFZP564D116 protein 


27 


28 


AL050022 


Hypothetical protein FLJ10134 


23 


24 


AK000996 


cDNA FLJ10433 fis NT2RP1000478 


73 


74 


AK001295 


ESTs 


93 


94 


AW250104 


ESTs 


95 


96 


BE382614 


ESTs 


67 


68 


AW071063 


ESTs 


67 


68 


AW964331 


ESTs 


133 


134 


AA612751 


Singleton EST (not in UniGene) 


135 


136 


AI018611 



The gene entitled "Jk-recombination signal binding protein" was found to be hypoxia-inducible using 
subtracted cDNA probes for hybridization, but with non-subtracted probes, where the hybridisation is 
quantitative, no signal was detected. This indicates that the gene is probably hypoxi'a-regulated but the 
5 absolute expression levels are very low . 
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TABLE 3: Hypoxia-inducible genes identified from clones derived from both macrophage and myoblast 
libraries. 





SEQ ID 


Hypoxia/ 


Hypoxia/ 


GENE NAME 


Accession 


protein 


nucleotide 


normoxia 


normoxia 










(macrophage) 


(myoblast) 


Solute carrier family 2, 
member 3 


NM 006931 


247 


248 


91,39 


8.23 


Solute carrier family 2, 
member 5 


NM_003039 


311 


312 


10.75 


2,26 


Adiphophilin 


NM_001122 


313 


314 


13.97 


5.10 


Hexokinase 2 


NM_000189 


249 


250 


11.50 


6,25 


Stearoyl-CoA desaturase 


AB032261 


351 


352 


3.74 


2,31 














cDNA DKFZp434O071 


AF125392 


75 


76 


2.31 


2.75 


Hypoxia-inducible protein 2 


NM_013332 


271 


272 


3.62 


5.07 



5 TABLE 4: Hypoxia responses amplified by HIFlalpha overexpression 



Gene Name 


Nucl 


Experimental Condition # 




Seq 
ID 


1 


2 


3 


4 


5 


6 


7 


8 


9 
























Metallothionein 2A 


265 


1 


0.57 


0.69 


3.33 


3.22 


5,77 


10.37 


2.05 


1.70 


Metallothionein 1G 


244 


1 


0.68 


0.64 


4.23 


4.21 


7.35 


11.03 


3.65 


2.28 


Hypothetical protein hqp0376 


338 


1 


0.79 


0.61 


6.54 


4.44 


9.01 


11.54 


4.17 


3.22 


Novel Metallothionein 


84 


1 


0.95 


0.78 


5.18 


4.36 


8.20 


11.16 


3.48 


2.94 



Legend: Data shown in the average of 4 repeat experiments, Experimental condition is as shown in the 
text. Values represent fold change as compared to untreated cells (condition 1). 

10 
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TABLE 5: Hypoxia responses amplified by EPAS1 overexpression 



Gene Name 


Nucl 


Experimental Condition # 








Seq 
ID 


1 


2 


3 


4 


5 


6 


7 


8 


9 
























cDNA DKFZp586E1624 


66 


1 


0.77 


0.67 


1.00 


1.12 


1.58 


0.83 


2.60 


2.49 


Butyrate response factor 1 


328 


1 


0.74 


0.64 


1.60 


1.64 


1.57 


1.23 


2.19 


3.20 


hypothetical protein FLJ10134 


24 


1 


0.62 


0.53 


2.73 


2.09 


2.80 


2.87 


4.20 


3.65 


EGL nine (C.elegans) homolog 3 


86 


1 


1.34 


0.81 


1.98 


1.90 


2.02 


1.94 


2.81 


3.12 


ER01 (S. cerevisiae)-like 


68 


1 


1.02 


1.30 


4.26 


4.14 


4.76 


4.12 


4.91 


6.44 


hypothetical protein FLJ10134 


24 


1 


0.68 


0.53 


2.03 


1.97 


3.01 


2.46 


3.67 


2.95 



Legend: Data shown is the average of 4 repeat experiments. Experimental condition is as shown in the 
5 text. Values represent fold change as compared to untreated cells (condition 1). 



TABLE 6, Negative hypoxia responses amplified by HIF1 alpha / EPAS1 overexpression 



Gene Name 


Nucl 


Experimental Condition # 




Seq 
ID 


1 


2 


3 


4 


5 


6 


7 


8 


9 
























Hypothetical protein CGI-117 


48 


1 


0.83 


0.87 


0.42 


0.42 


0.32 


0.34 


0.33 


0.27 
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CO 
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co 

CO 


to 

CO 
CO 


W72666 


AA490903 


H05914 


H38650 


AA700054 


AI313387 


R19956 


AA489261 


iAA598840 


|AA457671 


N70463 


IR69163 | 


AA489752 


AA446839 


AA063521 


AA700447 


AA448157 


N75719 


AA424743 


AA723035 


T64469 


T62491 


AI016779 


N21654 


AA669637 


345743 


824531 


43550 


190732 


435036 


1928791 


34778 


842863 


898328 


838802 


298268 


142139 | 


823691 


783697 


359982 , 


460618 


782760 


244307 


768299 


413633 


80484 


79629 


1638893 


266389 


857002 


Solute carrier family 5, member 3 


PSCDBP 


lactate dehydrogenase A 


Solute carrier family 2, member 5 


Adipophilin 


Tissue factor 


Vascular endothelial growth factor 


RTP/NDRG1 


Early development regulator 2 


Procollagen-proline 4-hydroxylase 
alpha 1 


B-cell translocation gene 1, 


SH3PX1 


Cyclin G2 


BCL2/adeno virus ElB-interacting 
protein 3 


BCL2/adenovirus ElB-interacting 
protein 3 


NAG-5 protein 


Cytochrome P450 IBl (dioxin- 
inducible) 


Plasminogen activator inhibitor, 
type I 


Butvrate response factor 1 


Butvrate response factor 1 


p8 protein (candidate of metastasis 

n 


Fusin/CXCR4 


solute carrier family 16, member 6 


solute carrier family 16, member 6 


Proline-rich protein with nuclear 
targeting signal (B4-2) 
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TABLE 15 Genes with increased expression by macrophage activation 
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t*leckstrin 


1.14 


0.70 


0.73 


0.54 


6.49 


2.34 


plC9 


AHA (A ri I 

373/374 


RAB-8b protein 


0.95 


0.81 


0.77 


0.94 


5.11 


4.53 


pi 124 


485/486 


GROl 


0.90 


0.72 


0.78 


1.04 


4.69 


2.96 


plG3 


317/318 


3 -cell translocation gene l 


0.70 


1.00 


0.57 


1.14 


3.51 


3.79 


plBl 


243/244 


Metallothionein 1G 


0.51 


1.00 


0.66 


1.85 


2.50 


3.83 


plJll 


465/466 


Fatty-acid-Coenzyme A 
igase, long-chain 2 


0.69 


0.51 


1.36 


0.91 


3.07 


2.97 


plF17 


329/330 


P8 protein (candidate of 
metastasis 1) 


0.26 


1.78 


0.16 


0.88 


1.16 


2.59 


plF4 


339/340 


CYP1 


0.60 


1.04 


0.77 


1.15 


2.52 


4.22 


pIDIO 


75/76 


insulin induced protein 2 


0.49 


1.00 


0.48 


1.23 


1.63 


4.96 


plE7 


83/84 


^ovel metallothionein 


0.49 


1.26 


0.70 


1.11 


1.32 


2.89 


plD24 


117/118 


EST 


0.58 


0.71 


1.00 


1.32 


1.56 


2.59 


plll9 


481/482 


GR02 


0.99 


1.00 


0.69 


0.55 


2.65 


2.29 


plE22 


161/162 


cDNA FLJ13618 fis, clone 
PLACE1010925 


1.19 


0.77 


0.85 


0.51 


3.09 


2.41 


plF6 


337/338 


hypothetical protein 
iqp0376 


0.44 


1.08 


0.47 


1.06 


1.11 


2.47 


plJ7 


477/478 


Sjogren syndrome antigen 
B 


1.06 


0.74 


0.85 


0.63 


2.65 


2.95 


plB 19 


235/236 


plasminogen activator 
nhibitor, type 1 


0.59 ' 


1.29 


0.45 


1.17 


1.46 


3.46 


plF24 


297/298 


Glia-derived nexin 


0.65 


0.68 


0.99 


1.10 


1.62 


1.80 


plP5 


395/396 


SCYA2 


0.94 


0.13 


3.81 


0.42 


2.27 


1. 00 


plA22 


263/264 


Adenylate kinase 3 


0.57 


1.30 


0.58 


1.64 


1.34 


2.74 


plA23 


265/266 


Metallothionein 2A 


0.55 


0.89 


0.95 


1.01 


1.23 


4.38 


plA24 


239/240 


Metallothionein 1H 


0.50 


0,84 


1.03 


1.02 


1.08 


1.60 
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p]P3 


221/222 


PDGFB 


0.47 


2.06 |0.31 


1.99 


1.00 


1.49 


plA7 


345/346 


SLC31A2 


1.10 


0.92 b.84 


0.93 


2.30 


2.32 


plP14 


91/92/92a 


Seraaphorin 4b 


0.47 


2.52 |0.95 


4.04 | 


3.96 


3.88 



Legend 

niRNA expression values in the 6 experimental conditions (#1 no cytokines/ normoxia, #2 no cytokines/ 
hypoxia, #3 IL-10/ normoxia, #4 IL-10/ hypoxia, #5 LPS/IFN/ normoxia, #6 LPS/IFN/ hypoxia) are show 
as values referenced to the median value of that gene throughout all 6 experimental conditions, 



TABLE 16. Genes down-regulated by macrophage activation 









mRNA EXPRESSION 








(experimental condition) 


Clone 


Sea ID 


Gene Name 


#1 


#2 


#3 


#4 


#5 


#6 




















plH13 


193/194 


EST 


1.35 


1.41 


1.00 


0.91 


0.44 


0.68 


plE4 


125/126 


EST 


1.22 


1.13 


1.09 


0.96 


0.40 


0.40 


plG7 


281/282 


EST 


1.30 


1.44 


1.01 


1.51 


0.54 


0.82 


plEl 


123/124 


EST 


1.21 


1.64 


0.94 


1.35 


0.51 


0.63 


plDl 8 


127/128 


cDNA FLJ13443 fis, clone 
PLACE1002853 


1.61 


2.60 


0.57 


1.33 


0.26 


0.24 


p 1 12 


149/150 


cDNA FLJ11302 fis, clone 
PLACE1009971 


2.39 


1.23 


1.07 


0.54 


0.45 


0.43 


plG20 


203/204 


cDNA YO23H03 


1.45 


0.73 


1.60 


1.12 


0,57 


p.44 


plD21 


129/130 


Hypothetical protein 
FLJ22622 


1.41 


1.72 


0.82 


1.26 


0.14 


0.14 


plF8 


9/10 


Hypothetical protein 
KIAA0914 


1.34 


4.14 


0.77 


2.74 


0.13 


0.25 


plD16 


33/34 


Hypothetical protein 
FLJ20308 


1.31 


2.36 


1.00 


1.59 


0.29 


0.67 


plF3 


333/334 


Hypothetical protein 
XP_017131 


1.63 


2.21 


0.92 


1.00 


0.42 


0.42 


plD 12 


29/30 


Hypothetical protein 
KIAA1376 


0.89 


2.62 


0.79 


2.07 


0.28 


2.61 


plI4 


53/54 


Hypothetical protein 
HSPC196 


1.95 


1.06 


1.20 


0.57 


0.63 


0.28 


plD9 


27/28 


Hypothetical protein 
DKFZP564D116 


1.63 


0.94 


1.25 


0.96 


0.55 


0.85 


plF9 


19/20 


Hypothetical protein 
KIAA0742 


0.94 • 


3.54 


0.60 


1.74 


0.33 


1.74 


plFll 


323/324 


Hypothetical protein 
LOC51754 


1.67 


1.91 


1.00 


0.86 


0.60 


0.59 


p 1 II 5 


47/48 


Hypothetical protein CGI- 
117 


1.31 


0.62 


1.86 


1.26 


0.49 


0.76 


plEl 3 


21/22 


Hypothetical protein 
PRO0823 


1.15 


0.93 


1.15 


1.08 


0.44 


0.24 
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plFlO 


"5/6" 


Hypothetical protein 
DKFZp434P0U6 


2.16 


1.05 


1.54 


0.83 


0.83 


0.67 


plDl 


23/24 


Hypothetical protein 
FLJ10134 


0.86 


1.70 


0.61 


2.42 


0.35 


1.61 


plIS 


41/42 


Hypothetical protein 
FLJ10815 


1.49 


1.00 


1.30 


0.83 


0.43 


0.37 


plGl 3 


293/294 


ABCA1 


1 


1.05 


1.22 


1.07 


0.43 


0.46 


plB9 


313/314 


adipophilin 


1.01 


3.74 


1.32 


2.08 


0.02 


0.2 


plB7 


313/314 


adipophilin 


1.57 


3.44 


0.77 


1.17 


0.21 


0.45 


plB6 


313/314 


adipophilin 


1.24 


2.45 


0.8 


1.51 


0.38 


0.52 


plB8 


313/314 


adipophilin 


1.11 


1.87 


1.01 


1.04 


0.55 


0.59 


plK7 


411/412 


ATP-binding cassette El 


1.34 


0.74 


1.58 


1.05 


0.62 


0.48 


pi J23 


447/448 


Calgranulin A 


1.21 


0.94 


3.35 


2.8 


0.58 


0.86 


plKl 8 


415/416 


Colony-stimulating factorl 


2.01 


0.84 


1.7 


0.88 


1 


0.74 


plC2 


331/332 


CXCR4 


1.01 


3.76 


0.46 


1.47 


0.08 


1.13 


plCl 


331/332 


CXCR4 


1.05 


3.64 


0,39 


1.63 


0.27 


0.98 


plG12 


321/322 


Cyclin G2 


0.85 


2.17 


0.6 


1.29 


0.28 


1.33 


piF16 


325/326 


CYP1B1 


1.37 


0.96 


1.67 


0.66 


0.64 


0.48 


plC7 


369/370 


D123 


1.69 


1.33 


1.1 


0.7 


0.65 


0.83 


plG17 


315/316 


Early development regulator 
2 


0.97 


2.47 


1.12 


2.24 


0.29 


0.85 


pi 123 


475/476 


icotropic viral integration 
site 2A 


1.39 


1.25 


1.11 


1.72 


0.18 


0.22 


plA14 


257/258 


Enolase 1 


0.9i 


3.22 


1.19 


2.46 


0.13 


0.37 


pi A 1 0 ^ 


273/274 


Enolase 2 


1.17 


5.28 


0.59 


3.77 


0.49 


1.08 


plD6 


67/68 


EROl (S. cerevisiae)-like 


0.84 


3.02 


0.97 


2.87 


0.32 


1.58 


plAll 


253/254 


GAPDH 


1.21 


2.41 


0.93 


1.31 


0.32 


0.81 


plA12 


253/254 


GAPDH 


1.09 


1.97 


1 


1.49 


0.41 


0.96 


plK22 


419/420 


GPR44 


1.24 


1.03 


1.42 


0.93 


0.56 


0.48 


pi CI 8 


269/270 


Granulin 


1.28 


1.59 


0.96 


1.03 


0.56 


0.72 


plC17 


269/270 


Granulin 


1.58 


1.6 


0.62 


0.41 


0.76 


0.94 


plA15 


249/250 


Hexokinase-2 


0.89 


3.88 


0.68 


3.11 


0.38 


2.02 


plCl 3 


381/382 


Jk-recombination signal 
binding protein 


1.11 


1.18 


1.43 


1.98 


0.32 


0.73 


plA8 


223/224 


Lactate dehydrogenase A 


0.7 


2.25 


1.4 


1.44 


0.26 


1.32 


plA9 


223/224 


Lactate dehydrogenase A 


0.77 


1.85 


1.15 


1.68 


0.32 


1.19 


plG5 


279/280 


M AX-interacting protein 1 


1.24 


5.5 


0.9 


4.48 


0.34 


0.97 


plD22 


119/120 


MAX-interacting protein 1 


1.2 


3.86 


0.52 


3.44 


0.37 


0.91 


plGl 8 


211/212 


Mitochondrion sequence 


1.27 


1.12 


1 


1.31 


0.57 


0.77 


plK23 


403/404 j 


MYC 


1.37 


0.77 


2.39 


1.09 


0.54 


0.35 


plE20 


99/100 


Vlyo-inositol 
monophosphatase A3 


1.12 


1.28 


1.02 


0.99 


0.48 


0.61 


p!B20 


267/268 


Osteopontin 


1.13 


1.58 


0.99 


1.52 


0.1 


0.4 


plF13 


343/344 


Papillomavirus regulatory 
factor PRF-1 


0.98 


5.02 


0.44 


6.79 


0.09 


2.43 


pi A13 


255/256 


Phosphoglycerate kinase 1 


1.04 


2.45 


1.23 


1.83 


0.2 


0.9 


plG9 


305/306 


PI-3-kinase, catalytic, beta 
polypeptide 


1.46 


1.88 


0.75 


1.17 


0.44 


0.47 
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plE18 


63/64 (piexinCl 


1.72 


1.79 


1 


0,85 


0.69 


0.35 


plCll 


377/378 


nnlvnhiniiiHn 
jvjiy u u 1 1*| u i ii 11 


1.13 


1.79 


0.79 


1.14 


0.5 


0.84 


plB3 


231/232 


■^rnlinp 4-hvHrn wlasp alnha 

1 1U1111& i 11 YUlUAJlflOV) uiuiiu 

nolvnentide I 


0.94 


1.38 


1.03 


1.58 


0.43 


0.89 


plB4 


349/350 


Proline 4-hvdroxvlase alnha 

L 1 v Jill V i 11 J U1UA J lUuvj Clip 11 W. 

polypeptide II 


0.9 


1.46 


1.05 


1.41 


0.44 


1 


plB22 


355/356 


3 rotease, serine, 1 1 


1.3 


1.1 


1.26 


0.92 


0.64 


0.7 


pICIO 


375/376 


Regulator of G-nrotein 
signalling 1 

HIS — — 


1.42 


1.68 


0.94 


1.55 


0.47 


0.95 


plD3 


95/96 


Serine carboxypeptidase 1 


1.22 


1.07 


1.07 


1.09 


0.33 


0.88 


plF15 


341/342 


SHB adaptor protein 


1.04 


1.61 


0.94 


1.72 


0.43 


0.54 


plA5 


311/312 


SLC2A5 


0.71 


2.6 


I 1.06 


2.09 


0.34 


1.09 


plG4 


307/308 


SLC5A3 


1.12 


1.44 


0.93 


1.31 


0.33 


0.62 


plA20 


261/262 


Triosephosphate isomerase 1 


0.97 


2.06 


1.09 


2.24 


0.17 


0.66 


plD15 


31/32 


TRIP-Br2 


1.16 


1.4 


1.1 


1.25 


0.47 


0.46 


plK4 


443/444 


TSC-22 


1.44 


1 


1.55 


0.7 


0.6 


0.57 



TABLE 17: Genes responsive to IL-10 (increased or decreased) but not affected significantly by LPS+IFN 









mRNA EXPRESSION 








(experimental condition) 


Clone 


Seq ID 


Gene Name 


#1 


n 




U 


ft5 


#6 




















plH8 


181/182 


ABL 


1.02 


0.96 


6.65 


5.25 


0.86 


0.73 


plE15 


107/108 


cDNA YI27F12 


0.48 


0.77 


1.69 


2.45 


0.78 


1.44 


p2A 14 


445/446 


EST 


1.06 


0.74 


2.78 


3.09 


1.06 


0.82 


plH6 


191/192 


EST 


1.01 


0.84 


2.47 


2.05 


0.93 


0.83 


plE5 


141/142 


Hepcidin antimicrobial peptide 


0.84 


0.73 


1.91 


1.68 


0.58 


2.16 


p 1 11 2 


151/152 


Hypothetical protein MGC4549 


1.07 


0.67 


2.34 


2.53 


1.11 


0.74 


plD8 


271/272 


Hypoxia-inducible protein 2 


0.65 


1.00 


1.51 


1.89 


0.71 


2.05 


plK14 


421/422 


Keratin 6B 


1.03 


0.68 


3.80 


3.28 


0.97 


0.76 


pi J22 


427/428 


Neutral sphingomyelinase (N-SMase) 
activation associated factor 


0.94 


0.79 


5.59 


3.52 


0.91 


1.29 


plG15 


287/288 


Phosphoglucomutase 1 


0.82 


1.20 


1.83 


1.90 


0.61 


1.05 


plA2 


247/248 


SLC2A3 


0.37 


3.31 


1.00 


3.32 


0.49 


2.63 


plA3 


247/248 


SLC2A3 


0.39 


2.45 


1.00 


2.65 


0.20 


1.50 


plK2 


433/434 


CFFM4 


1.30 


0.98 


0.51 


0.59 


1.11 


0.91 


plC4 


363/364 


FGF receptor activating protein 1 


1.02 


0.96 


0.50 


0.63 


1.16 


1.31 
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TABLE 20: Genes up-regulated in response to TNFa 
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6.03 


plA20 


rdosephosphate isomerase 1 


262 


1 


16.1 
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TABLE 21: Genes down-regulated in response to TNFa 









Cytokine / % Oxygen 








none 


none 


TNF 


TNF 


Clone 


Gene Name 


SeqID 


20 


0.1 


po 
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1.50 
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0.70 
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TABLE 22: Genes up-regulated in response to IL-17 









Cytokine / % Oxygen 
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Gene Name 


SeqID 


20 
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1.50 


9.34 


7.13 
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396 




0.24 


2.12 


0.58 
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408 




0.44 


2.48 


0.83 



WO 02/46465 PCT/GB01/05458 

242 



TABLE 23: Genes down-regulated in response to IL-17 









Cytokine / % Oxygen 








none 


none 
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Clone 


Gene Name 


SeqID 


20 
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20 
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182 
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1.08 


0.08 
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SMase) activation associated 
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TABLE 24: Genes up-regulated in response to IL-15 









Cytokine / % Oxygen 
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Clone 


Gene Name 


SeqID 


20 
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20 
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Aldolase C 
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1.39 


plJll 


Fatty-acid-Coenzyme A 
ligase, long-chain 2 


466 
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0.55 


1.64 


1.29 


plAll 


GAPDH 


254 


| 


0.78 


0.52 


4.32 


P 1C6 


Glucose phosphate isomerase 


368 


] 


0.84 


0.57 


3.13 


plH5 


Hypothetical protein 
FLJ22690 


206 




0.94 


2.37 


1.59 


plA23 


Metallothionein 2A 
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3.08 
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396 
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3.27 


1.61 


plI21 


SCYA8 
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0.37 


3.77 


1.55 


plI7 


Uridine 5' monophosphate 
hydrolase 1 


50 
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0.84 


4.98 


3.61 
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TABLE 25: Genes down-regulated in response to IL-15 









Cytokine / % Oxygen 
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Clone 


Gene Name 


jSeqID 


20 
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20 
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plH8 


ABL 
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1.08 


1.22 


0.09 


plC14 


Abstract 


384 
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A dipophilin 


314 
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plC7 


D123 
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0.47 
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pJH6 


EST 


192 




1.46 


1.95 


0.67 


p2A 14 


EST 


446 




1,10 


1.08 


0.51 


plG24 


Glycogen synthase 1 


276 




1.45 


0.88 


0.55 


plJ6 


hypothetical protein 
FLJ10206 


40 




1.28 


1.28 


0.18 


pill 2 


Hypothetical protein 
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1.20 


1.64 


0.28 


plD8 


Hypoxia-inducible protein 2 
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0.62 


plK14 


Keratin 6B 


422 




1.27 


1.48 


0. 11 
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Lactate dehydrogenase A 


224 




1.31 


0.48 


2.29 
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Lactate dehydrogenase A 


224 
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Neutral sphingomyelinase (N- 
SMase) activation associated 
factor 
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0.11 
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Proline 4-hydroxylase, alpha 
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Sequence listing 

1 

MLTLPQNFGNIFLGETFSSYISVHNDSNQWKDILVKADLQTSSQRLNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCA 
5 VSYTTQAGEKMYFRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEWLEAQIQmTTSPMFMEKVSLEPSIMYWTELNSVSQAG 
ECVSTFGSRAYLQPMDTRQYLYCLKPKNEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPD 
TWLEEPFHITCKITNCSERTMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLCLALTLLSSVQGLQSISGLRLTDTFLKRTYE 
YDDIAQVCWSSAIKVES 

2 

10 AAAAAGTGCCGGTCAAAATGGAAGTGAATCCCCCTAAACAGGAGCACCTGCTGGCGCTAAAAGTGATGCGGCTGACTAAGCCTA 
CTTTATTCACCAATATCCCAGTAACATGTGAAGAGAAAGACTTACCTGGAGATCTCTTTAACCAGCTGATGAGAGATGATCCTT 
CAACCGTTAATGGTGCAGAAGTTTTAATGTTGGGATAAATGCTGACTTTACCACAGAATTTTGGGAATATATTTTTGGGAGAGA 
CCTTTTCCAGTTATATCAGCGTTCATAATGATAGCAATCAAGTTGTAAAAGACATATTAGTAAAAGCTGATCTTCAGACAAGTT 
CTCAGCGTTTAAATCTTTCAGCCTCCAATGCTGCAGTGGCTGAACTTAAACCGGATTGTTGTATTGATGATGTCATACATCATG 

15 AAGTCAAAGAAATTGGAACACACATCTTGGTATGTGCTGTGAGTTATACAACTCAGGCTGGAGAAAAAATGTATTTCAGAAAAT 
TCTTCAAATTTCAGGTTCTCAAACCATTGGATGTGAAAACCAAATTTTACAATGCAGAGAGTGACCTCAGTTCTGTGACTGATG 
AAGTATTTCTGGAAGCCCAGATTCAGAATATGACAACCTCACCTATGTTTATGGAGAAGGTTTCACTGGAGCCATCTATTATGT 
ACAATGTAACAGAATTAAATTCAGTCAGCCAAGCTGGAGAATGTGTGTCTACGTTTGGGTCAAGAGCATATTTGCAACCAATGG 
ATACACGCCAGTACTTATACTGCCTAAAGCCAAAGAATGAATTTGCAGAAAAAGCAGGCATCATTAAGGGAGTAACAGTAATTG 

20 GAAAATTGGATATAGTATGGAAAACAAATCTAGGTGAAAGGGGAAGGTTACAGACCAGCCAACTTCAAAGAATGGCTCCAGGTT 
ATGGAGATGTTAGGTTGTCTTTGGAGGCAATACCAGATACCGTAAACCTTGAAGAACCTTTTCATATTACCTGTAAAATAACAA 
ACTGCAGTGAAAGGACTATGGATCTGGTTTTGGAAATGTGCAATACCAATTCCATCCACTGGTGTGGAATTTCAGGAAGACAGC 
TGGGAAAGCTGCATCCAAGTTCTTCGCTCTGTCTTGCCCTTACTCTGCTTTCTTCAGTACAGGGACTGCAAAGCATCTCTGGCT 
TAAGACTAACAGACACATTCTTAAAGAGAACATATGAATATGATGACATCGCACAAGTCTGTGTGGTATCTTCTGCCATTAAAG 

25 TGGAAAGCTGAAGGAAACTTCCAATGTTAGGCTTTTCATTTAGTTTCACAGAACTGCTCTTTTTGTTACCTTTGTAAAATGATG 
ACGTCAACAACGAAGTAAATATGTTACTTAAATTTTCTTTCCTGTAAAATGGTTAAAATATTAAATATTTGTTTTGAAGAGATC 
TGATTTTATCTTGTAATTTATATTTGAAATGAACATGTGTATATTTTCTACACCTATTATTTAATTTCATTTCATTTTAGATGA 
CCATTGGACTTTGTTCTCCAAAAGCTGTGTATCTGAGACCATTTGTCCCTAGCAAGTTATCTAGAACACGAGTCAGCACACTTT 
TTATGTAAGGTACAAGATACTAAATATTTTAGGCTTTGCAGGCCGTAAGGTTTCTGTCACAAATACTGAACTCTGCCATCGTAC 

30 TTCAAGAGCAGCCAGTAGGCAAAAAAAGTACAGTGGTGTACACAAAAAGAAGCAGTTGGACTCGGGAAGCTGAGGCAGGAGAAT 
CACTTGAACCGGGGAGGCAGAGGTTGCAGTTAAGCCGAGATCATGCTACTGCACTCTAGCCTGAGCGACAGAGTGAGACTCTGT 
CTCAAAAAAAAAAGAGGCAGCTGGGCAGATTTGGCCTGCAGGGTATAGTTGGCTGACCTCAATCTAGAAGATAGAGTTGTTTCT 
TAGAACTTTGTAATATTTTGGTTAGAAATATTTCAGTTAACTCCAGTTTTTTCCTAGCTATTCGGTCAGCAGAAAAAGTTTGCT 
TAGAACAAAGGGGGTAGTCCTCTTATAACCTGATATTTGTCAACACAGTATAAGCTTTCTAGTTGTTTTCAAAATTACTGTAAT 

35 GGTTTGTGTCTGTATTTGTATATTATACCTACAAACATACATATCTTTCAGCTACTGAACACATCAGAATTGGTTTTACTTTTT 
AGGTTGTTGATTTTTTTTAATTAACTATTAACTAAGAATAATACTTATTTTCTAATTATAAGCCAAAAACCCCTGGGTTTTTTT 
CTCATTGATTAAGCCACTTATTTTTATAAGGTATATTTAAATAATTCCCATCTTCTGATATGATTTTAAACTCTAAATCATGTG 
TTACTGTTACAAAACTTTCTCTCTCCATGTAATCACACTTAGTTATGAGCAAAGCAGTGAGAAAGTTGAGGTGATAAACAACTT 
TGCAGCATTAATGTAACAAAAAGTTTCAGTATTTGTCAGGAAAACAAAAGCTTAGGCTGTTAAATAAAAGCTATTCTATTTTGG 

40 TAAGTTGAAATTCAAGAATGTATATATAATTTCTTAAATGTAAGAGTGCATTAAATCAAATACAAAAATTGGTTTTTCAATGCA 
TCATTATTTATTGCCATAACAAATTGTTTGGTCCAAAATGAAAGCTATATTCAGAAGGTTTGCACCTCAAAATTGAGTAATAAT 
TAATGAAATGTCTGTACAAAATAAAGTGCAAGCAGTGT 

3 

MQMRRVWRTCRSGPRKRREARYKKSTEDIFPAQLLKLQRHERWQQEPPVRDHRSWGGSGAGGVAGREWTDQGQVALGGHYMAE 
45 GEGYFAMSEDELACSPYIPLGGDFGGGDFGGGDFGGGDFGGGGSFGGHCLDYCESPTAHCWLNWEQVQRLDGILSETIPIHGR 
GNFPTLELQPSLIVKWRRRLAEKRIGVRDVRLNGSAASHVLHQDSGLGYKDLDLIFCADLRGEGEFQTVKDWLDCLLDFLPE 
GWKEKITPLTLKEAWQKMVKVC^ 

HPTIIGESVYGDFQEAFDHLCNKIIATRNPEEIRGGGLLKYCNLLVRGFRPASDEIKTLQRYMCSRFFIDFSDIGEQQRKLESY 
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LQNHFVGLEDRKYE YLMTLHGWNE S TVCLMGHERRQTLNL I TMLAI RVLADQNVI PNVAWTC YYQPAPYVADANF SNYYI AQ 
VQPVFTCQQQTYSTWLPCN 

4 

ACAACTGCTAAAGCTCCAGAGACACGAGCGTGTGTGGCAGCAAGAGCCGCCAGTTCGGGACCACCGCAGCTGGGGTGGCAGCGG 
5 CGCAGGAGGGGTCGCGGGGCGGGAGTGGTGAGCGCAGGCGGCAGGGGTCTGGGAAAGACGMGTCGCTATTTGCTGTCTGAGCG 
CGCTCGCAGCTCCTGGAAGTGTTGCCGCCTCTCGGTTTCGCTCTCGCTCGCTGCGCTCCTAGAAGGGGCGGCCGCCTCCAGGAC 
TGACCAGGGCCAAGTGGCGCTCGGCGGGCACTACATGGCGGAGGGTGAAGGGTACTTCGCCATGTCTGAGGACGAGCTGGCCTG 
CAGCCCCTACATCCCCCTAGGCGGCGACTTCGGCGGCGGCGACTTCGGCGGCGGCGACTTCGGCGGCGGCGACTTCGGCGGCGG 
CGACTTCGGCGGTGGCGGCAGCTTCGGTGGGCATTC^ 

10 GGAGCAAGTGCAGCGGCTGGACGGCATCCTGAGTGAGACCATTCCGATTCACGGGCGCGGCAACTTCCCCACGCTCGAGCTGCA 
GCCGAGCCTGATCGTGAAGGTGGTGCGGCGGCGCCTGGCCGAGAAGCGCATTGGCGTCCGCGACGTGCGCCTCAACGGCTCGGC 
AGCCAGCCATGTCCTGCACCAGGACAGCGGCCTGGGCTACAAGGACCTGGACCTCATCTTCTGCGCCGACCTGCGCGGGGAAGG 
GGAGTTTCAGACTGTGAAGGACGTCGTGCTGGACTGCCTGTTGGACTTCTTACCCGAGGGGGTGAACAAAGAGAAGATCACACC 
ACTCACGCTCAAGGAAGCTTATGTGCAGAAAATGGTTAAAGTGTGCAATGACTCTGACCGATGGAGTCTTATATCCCTGTCAAA 

15 CAACAGTGGCAAAAATGTGGAACTGAAATTTGTGGATTCCCTCCGGAGGCAGTTTGAATTCAGTGTAGATTCTTTTCAAATCAA 
ATTAGACTCTCTTCTGCTCTTTTATGAATGTTCAGAGAACCCAATGACTGAGACATTTCACCCCACAATMTCGGGGAGAGCGT 
CTATGGCGATTTCCAGGAAGCCTTTGATCACCTTTGTAACAAGATCATTGCCACCAGGAACCCAGAGGAAATCCGAGGGGGAGG 
CCTGCTTAAGTACTGCAACCTCTTGGTGAGGGGCTTTAGGCCCGCCTCTGATGAAATCAAGGCCCTTCAAAGGTACATGTGTTC 
CAGGTTTTTCATCGACTTCTCAGACATTGGAGAGCAGCAGAGAAAACTGGAGTCCTATTTGCAGAACCACTTTGTGGGATTGGA 

20 AGACCGCAAGTATGAGTATCTCATGACCCTTCATGGAGTGGTAAATGAGAGCACAGTGTGCCTGATGGGACATGAAAGAAGACA 
GACTTTAAACCTTATCACCATGCTGGCTATCCGGGTGTTAGCTGACCAAAATGTCATTCCTAATGTGGCTAATGTCACTTGCTA 
TTACCAGCCAGCCCCCTATGTAGCAGATGCCAACTTTAGCAATTACTACATTGCACAGGTTCAGCCAGTATTCACGTGCCAGCA 
ACAGACCTACTCCACTTGGCTACCCTGCAATTAAGAATCATTTAAAAATGTCCTGTGGGGAAGCCATTTCAGACAAGACAGGAG 
AGAAAAAAAAAAAAAAAAAAAA 

25 5 

MKKFSRMPKSEGGSGGGAAGGGAGGAGAGAGCGSGGSSVGVRVFAVGRHQVTLEESLAEGGFSTVFLVRTHGGIRCALKRMYVN 

MPDLIWCKREITIMKELSGHKNIVGYLDCAW 

LHQCKTPIIHRDLKVENILLNDGGNWLCDFGSA^^ 

GCLLYKLCFFTLPFGESQVAICDGNFTIPDNSRYSRNIHCLIRFMLEPDPEHRPDIFQVSYFAFKFAKKDCPVSNINKCCKQLL 
30 RHGALLTEILLFLQLFLNRMTASEAAARKSQIKARITDTIGPTETSIAPRQRPKANSATTATPSVLTIQSSATPVKVLAPGEFG 
NHRPKGALRPGNGPEILLGQGPPQQPPQQHRVLQQLQQGDWRLQQLHLQHRHPHQQQQQQQQQQQHHHHHHHHLLQDAYMQQYQ 
HATQQQQMLQQQFLMHSWQPQPSASQYPTMPQYQQAFFQQQMLAQHQPSQQQASPEYLTSPQEFSPALVSYTSSLPAQVGTI 
MDSSYSANRSVADKEAIANFTNQKNISNPPDMSGTOPFGEDNFSKLTEEELLDREFDLLRSSKGHLKAYFASQ 

6 

35 AGCCGGGCAGCTGCAGCGGAGCCGCGGAGCGGGCGGCGGGGCCCAGGCTGTGCGCTTGGGGAGCGCGGAATGTGAGGCTTGGCG 
GGCCGCAGCACGCTCGGACGGGCCAGGGGCGGCGACCCCTCGCGGACGCCCGGCTGCGCGCCGGGCCGGGGACTTGCCCTTGCA 
CGCTCCCTGCGCCCTCCAGCTCGCCGGCGGGACCATGAAGAAGTTCTCTCGGATGCCCAAGTCGGAGGGCGGCAGCGGCGGCGG 
AGCGGCGGGTGGCGGGGCTGGCGGGGCCGGGGCCGGGGCCGGCTGCGGCTCCGGCGGCTCGTCCGTGGGGGTCCGGGTGTTCGC 
GGTCGGCCGCCACCAGGTCACCCTGGAAGAGTCGCTGGCCGAAGGTGGATTCTCCACAGTTTTCCTCGTGCGTACTCACGGTGG 

40 AATCCGATGTGCATTGAAGCGAATGTATGTCAATMCATGCCAGACCTCAATGTTTGTAAAAGGGAAATTACAATTATGAAAGA 
GCTATCTGGTCACAAAAATATTGTGGGCTATTTGGACTGTGCTGTTAATTCAATTAGTGATAATGTATGGGAAGTCCTTATCTT 
MTGGAATATTGTCGAGCTGGACAGGTAGTGAATCAAATGAATAAGAAGCTACAGACGGGTTTTACAGAACCAGAAGTGTTACA 
GATATTCTGTGATACCTGTGAAGCTGTTGCAAGGTTGCATCAGTGTAAGACTCCAATAATTCACCGGGATCTGAAGGTAGAAAA 
TATTTTGTTGAATGATGGTGGGAACTATGTACTTTGTGACTTTGGCAGTGCCACTAATAAATTTCTTAATCCTCAAAAAGATGG 

45 AGTTAATGTAGTAGAAGAAGAAATTAAAAAGTATACAACTCTGTCATACAGAGCCCCTGAAATGATCAACCTTTATGGAGGGAA 
ACCCATCACCACCAAGGCTGATATCTGGGCACTGGGATGTCTACTCTATAAACTTTGTTTCTTCACTCTTCCTTTTGGTGAGAG 
TCAGGTTGCTATCTGTGATGGCAACTTCACCATCCCAGACAATTCTCGTTACTCCCGTAACATACATTGCTTAATAAGGTTCAT 
GCTTGAACCAGATCCGGAACATAGACCTGATATATTTCAAGTGTCATATTTTGCATTTAAATTTGCCAAAAAGGATTGTCCAGT 
CTCCAACATCAATAAGTGTTGTAAACAATTACTGAGACACGGAGCACTGTTAACTGAAATTCTTCTATTCCTTCAGCTCTTCCT 



WO 02/46465 



PCT/GB01/05458 



266 

GAACCGATGACTGCTAGTGAAGCAGCTGCTAGGAAAAGCCAAATAAAAGCCAGAATAACAGATACCATTGGACCAACAGAAACC 
TCAATTGCACCAAGACAAAGACCAAAGGCCAACTCTGCTACTACTGCCACTCCCAGTGTGCTGACCATTCAAAGTTCAGCAACA 
CCTGTTAAAGTCCTTGCTCCTGGTGAATTCGGTAACCATAGACCAAAAGGGGCACTAAGACCTGGAAATGGCCCTGAAATTTTA 
TTGGGTCAGGGACCTCCTCAGCAGCCGCCACAGCAGCATAGAGTACTCCAGCAACTACAGCAGGGAGATTGGAGATTACAGCAA 
5 CTCCATTTACAGCATCGTCATCCTCACCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCACCACCACCACCACCACCACCAC 
CTACTTCAAGATGCTTATATGCAGCAGTATCAACATGCAACACAGCAGCAACAGATGCTTCAACAACAATTTTTAATGCATTCG 
GTATATCAACCACAACCTTCTGCATCACAGTATCCTACAATGATGCCGCAGTATCAGCAGGCTTTCTTTCAACAGCAGATGCTA 
GCTCAACATCAGCCGTCTCAACAACAGGCATCACCTGAATATCTTACCTCCCCTCAAGAGTTCTCACCAGCCTTAGTTTCCTAC 
ACTTCATCACTTCCAGCTCAGGTTGGAACCATAATGGACTCCTCCTATAGTGCCAATAGGTCAGTTGCTGATAAAGAGGCCATT 

10 GCAAATTTCACAAATCAGAAGAACATCAGCAATCCACCTGATATGTCAGGGTGGAATCCTTTTGGAGAGGATAATTTCTCTAAG 
TTAACAGAAGAGGAACTATTGGACAGAGAATTTGACCTTCTAAGATCAAGTAAGGGACACTTGAAGGCTTATTTTGCTTCACAG 
TAAAATAACAGCTCTATTATTATTCAGCAAGGCCAAAGACTTTTGAGAATGTGTATGGAAAATTCTTTTTGTGCATTTGAGGGC 
AAAATTCAGGCCATCTTCTTATACATATACTATCAAATTATGTTGTGTGCATTAGAAATCAGTTGCTTGATAGTAGCTATTAAA 
CCCAATATTGCTGATAGTATGCTAATATCCTAAAACTTAAATATTGCATATCTATGAATGTTAAATTCAGAATATCTCTAAAAC 

15 ATGGAAAATTGATGTTTCAATAAAAAGGGAGACATTTTATTATTTTGCCTTACTAATGATTTTGCAGCTCTGTTTTTCTGCACA 
CTCAATAGAAAATATTGTGGGTCTGAGATGCCCTTTGAAAATGCCTGAAAGAAAACATGGGCTACTACATTATGTTAATGTTTT 
GTAATGTCCTTTTAATGAGTGGTGACATAAAAGGGCTGCTTTGTTGTCCTGATATGGCAAAAATTAAATAAATTCTTTTTCACT 
TTAAAAAAAMAAAAAMAMAAAAAA , 

7 

20 MQWRALVLGLVLLRLGLHGVLWLVFGLGPSMGFYQRFPLSFGFQRLRSPDGPASPTSGPVGRPGGVSGPSWLQPPGTGAAQSPR 
KAPRRPGPGMCGPANWGYVLGGRGRGPDEYEKRYSGAFPPQL^ 

DPSNLNINDVLGNYSLTLVDALDTLAIMGNSSEFQKAVKLVINTVSFDKDSTVQVFEATIRVLGSLLSAHRIITDSKQPFGDMT 
IKDYDNELLYMAHDLAVRLLPAFENTKTGIPYPRWM 

MLRSNDTGLLGNVWIQTGHWVGKQSGLGAGLDSFYEYLLKSYILFGEKEDLEMFNAAYQSIQNYLRRGREACNEGEGDPPLY 
25 WMFSGQLMNTWIDSLQAFFPGLQVLIGDVEDAICLHAFYYAIMRY 

TKNPFYLHVGMDILQSLEKYTKVKCGYATLHHVIDKSTEDRMESFFLSETCKYLYLLFDEDNPVHKSGTRYMFTTEGHIVSVDE 
HLRELPWKEFFSEEGGQDQGGKSVHRPKPHELKVINSSSNCNRVPDERRYSLPLKSIYMRQIDQMVGLI 

8 

GGTGGTCGGCGGGGAGGCCCCCGCGCTTTAAAATAATGCCCGCGGCGCCCGCGCGACCATGCAATGGCGAGCGCTCGTCCTGGG 
30 GCTGGTGCTCCTCCGGCTTGGCCTCCATGGAGTATTGTGGCTCGTCTTCGGGCTGGGGCCCAGCATGGGCTTCTACCAGCGCTT 
TCCGCTCAGCTTCGGCTTCCAGCGTCTGAGGAGCCCCGACGGCCCCGCGTCGCCCACCTCGGGGCCCGTGGGCCGGCCTGGGGG 
GGTATCCGGGCCGTCGTGGCTGCAGCCGCCGGGGACCGGGGCAGCGCAGAGCCCGCGCAAGGCTCCGCGGCGTCCTGGGCCGGG 
GATGTGCGGCCCAGCCAACTGGGGCTACGTGCTGGGCGGCCGGGGCCGCGGCCCGGACGAGTACGAGAAGCGCTACAGCGGCGC 
CTTCCCTCCGCAGCTGCGTGCCCAGATGCGCGACCTGGCACGGGGCATGTTCGTCTTTGGCTACGACAACTACATGGCTCACGC 
35 CTTCCCCCAGGACGAGCTCAACCCCATCCACTGCCGCGGCCGTGGGCCCGACCGCGGGGACCCTTCAAATCTGAACATCAATGA 
TGTACTAGGGAACTACTCATTGACTCTTGTTGATGCATTGGATACACTTGCAATAATGGGAAATTCATCCGAGTTCCAGAAAGC 
AGTCAAGTTAGTGATCAACACAGTTTCATTTGACAAAGATTCCACCGTCCAAGTCTTTGAGGCCACGATAAGGGTCCTGGGAAG 
CCTCCTTTCTGCTCACAGAATAATAACTGACTCCAAGCAGCCCTTTGGTGACATGACAATTAAGGACTATGATAATGAGTTGTT 
ATACATGGCCCATGACCTGGCGGTGCGGCTCCTCCCTGCTTTTGAAAACACCAAGACAGGGATTCCATATCCTCGGGTGAATCT 
40 AAAGACAGGAGTTCCTCCTGACACCAATAATGAGACATGCACAGCGGGAGCCGGTTCCCTCCTGGTGGAATTTGGGATTCTGAG 
TCGACTCCTGGGGGACTCCACATTTGAGTGGGTGGCCAGACGAGCAGTGAAAGCCCTTTGGAACCTCCGGAGCAATGATACAGG 
ATTACTAGGCAATGTCGTGAACATTCAGACGGGCCACTGGGTTGGAAAGCAGAGTGGCCTGGGTGCCGGGCTGGACTCCTTCTA 
TGAATACCTCTTGAAATCTTACATTCTCOT 

CTACTTAAGAAGAGGGCGGGAAGCCTGCAATGAAGGAGAAGGAGACCCTCCACTCTATGTCAACGTGAACATGTTCAGTGGGCA 
45 GCTGATGAACACCTGGATTGACTCTCTGCAGGCCTTTTTCCCTGGACTGCAGGTGCTGATAGGAGATGTGGAAGATGCCATCTG 
CCTTCATGCCTTCTACTATGCCATATGGAAACGATATGGTGCCCTCCCTGAGAGATATAACTGGCAGCTGCAGGCCCCTGACGT 
TCTCTTCTACCCACTGAGACCAGAGTTAGTGGAATCCACATATCTCCTCTACCAGGCAACCAAGAATCCCTTCTACCTCCATGT 
AGGAATGGATATTCTGCAGAGTCTGGAAAAGTACACAAAAGTCAAGTGTGGGTACGCCACGCTGCATCACGTCATTGACAAGTC 
CACAGAAGACCGGATGGAGAGCTTCTTTCTCAGTGAGACCTGTAAATATTTGTATCTGCTGTTTGATGAAGACAATCCAGTACA 
50 CAAGTCTGGAACCAGATACATGTTCACAACAGAGGGACACATTGTATCTGTGGATGAGCATCTTCGGGAATTGCCATGGAAGGA 
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ATTCTTCTCTGAAGAGGGAGGGCAGGACCAAGGGGGAAAGTCTGTGCACAGGCCGAAACCTCATGAGTTAAAAGTCATCAACTC 
CAGCTCCAACTGCAATCGTGTACCTGATGAGAGGAGGTACTCCCTGCCCTTAAAGAGCATCTACATGCGACAGATTGACCAGAT 
GGTTGGTTTGATTTGATCTGCTCTCTGTGAGGCCTCATCTTGAACCAGACCTTAACGACCAAACCCAGACCATGCCAAAGTCCA 
GTCTGAAATGAAAGGGGACAGAAGTCTTGCTGTCCATGGTGGTGTAGGAATTTCTGTGCAACACCTCACCACGTCTGGTTAATC 
5 CTTGCACACTTCAGTGTTTCTCTCCTGTTCAATAAAATGCCCTGTTAAGGATATAATTTGAAGTGAGAAGATACATGGAAATTG 
CCCTCTTATGACATGTTGATGTTATAAGCACAATAGATGGGGCATCTTTGGATTGATGTTCACAGCTTTATACTTCAGAACCTA 
AGTCTCTTCACTTTGCTGGCACCTGCTATACTGGAGTATTGCTATGTCTTTAAAAAATTTTTTTTTATTATATTTTATTTTTTT 
GAGACAGGGTCTTGATATTTTTTTGGGACAGGGTTACCTGGGCTCAAGTGATCCTTCTGCCTCAGCCTCCCGAGTAGCTGGGAT 
TACAGGTGAGCACCACTGTACCTGGCTAGCTACTTCTTTGTTAGAGGATTGAGAATGAAATTTCTGCAAAAGGGCCCATGGTTC 

10 ATTTGGTATCCCTATTTAATTGCATTGAAAATGTCATCCTTTCTGTTGTTAGATAATTGGGGTCTTCCCCTGATATCCAACCGT 
GATTTTGGATCACATGGGAGAAAAAGTCATCCAGTTTTTCATGTTTGCCTCAAGTAATCTTTACAGTGTTACAAATTATTTGCT 
TAAGAAGAATGGTCTTAACCAGAATTCTTAACAGATAGTCTCTTAGGTTATTATGTTATGGTCTAAGAGGTTAACTGACATCTT 
TTGGATGGTATTTTGCATTTTGAATATGAACTTACCTGAGGAACTCCCATAGTTCCAGAATCAGGTGCCTTTTAGGGAGAGAAC 
AATACCTAAGATTGTCTGAGCTTCCATCTTTCTCATATTTCCTAAGCAAGGATTCTCACTTATGACCATATTTGGGTTAGAGTT 

15 CTGTTTTGTTTCTGTTTTCTGTGTCTAGTGCCAATTAGCTAAATCAGGGAGAAAGAAATGATCACATGACTTTTAGCATCCTTG 
AGCCATTTCTCTGTGTAATACAGGCTTTAGATTAGTGCCTTATATTGGTTTTGGTTTGGGGCACTGGATGTCGCAGCTACTGCT 
ATGGTTTCAGGAGGCCTGTTTAGCCACATGGTGAGACCGTGGTGAAAGGGGGATGGAAATTGCTTGGCCAGTCTTTGCCTTTCA 

TCTCTTTGAACTCACACTGATACACACCTGCTACTCTTACACAGTGCAGCAGGGCTGACTCTTAGTCTGGCTTCCATGAAGCGT 

20 CATGGGTGGAMCGCATTCTAGTAAAAAAGGTAGGAAATCCCTAAAACTTCCAGCCTCACATAGCACGGTTCTCACCTGTCACT 
GTTTTCCCACCTCTAAGGATTTCATGTACATCTTTTCAAAGCTAGAAATAAGCACTGTCTAAGTTTATGTTGCATTTTTAGTCA 
AAAGGGAGAAATCTTATTCCTTCTTGAAAATTTTAAGTGT.TATGGTTTTATATAGTTCAGTTCTTTGAGATTTTTGAAAAGAGT 
ATTTTCAGTAATAAACGTGCCATCTCTATCTCTTAAACATTTATTACAACAATTGTTTTAAMTAGAAAAAATAAAATGCTTCT 
ATTTTACCTTTTTTCATTTCAGAAGCATTATTCTGTTTATTAACAGTGTCCCATCTACTGAATAGAAAACTTTGAGAATAATAT 

25 ATATATATATTTTAAATGTTTTCACTGACTCATTGAAAATGTTAATTACACACACATGCATGCATGCACACACGAGCATACTTG 
TACCTTTGTCTCTGGGCAAACAGGTGGGACTGTTAGTGACCCATTTGGGAAAATAGAGCATCTCAGAGAAGGAGGTGAGTTCTT 
CCTGCCTGTGATTTCTCTTGGCGCTCCCCTCCTCTCCCGCTCTGGCTTCTGTGGCGGCAGTGGTGGGTAAGCACTCCAGTGTTC 
TCTTAATGAGGCACTTTGCCTGTCACTCGAGCAAGCCTGGGTGTTCCTTCCTCCTCATGCTCCTGGAATAGGGAATAGGGATCT 
CATGCTTGCAAACTACACAATGCTGCAGGTGCTTCCCAGGGGCCACAGGCTGTCAGGAAACGTGTTTTATGTTAAGTCACAAAC 

30 CCACTTGACTTCTGGGTACTGGAATTAATACCAGTGGGTGAGACTGAGGGTGAGTGAGTTAGTACATATTAATCCTGGTTGTTG 
AGCTTCCAGACTACCCCGTCCAAAGTTTGATGCTATGTAGTCAGTGGTTTGTGGGGCTGGATGCCAGAAGGTTCTTTGAGCCAG 
TTTCAAAGGTTACTTGTTTTTTTTTTTTTTTTTTTAAGTCAGAATGTTAACAGCTGTGATATATCCTGCAGGGCTTTTGCAGTT 
TCTTCTGTTCTGTGTTCTGAAATCCTGGGTAGAGAATGGCTGAGGAGGAGATTACCAGAGAAGTTGCTTTGCTCAGTGCTTTGC 
CCCAGGATTGCCTCAAATCTGAGTGGACTTCATCCTTTGCGGCGGGTCTGAGCCTGGCCCATCTTCCTATTCCCACGTGTAGCT 

35 AGTGTCTAGTGTCAGCTTTGCTCAATGTGGTGGAAACATTTTGCAGAACTGTTGTAGAAAGCTGCCTTATAGTTGGCTTGACAA 
AGCATAATTCTCTCATAACAAACTTTCAAATCATTACAGTAGCTTAGCTACTTTAGTTGATGTGACCGAGGAATCCCTTCTAGA 
ATCATAGGTGGCAAGGGAGGGTTTGCTAGCTCTCCATTTGCACTGGCCATTGTGAAAAACCAGCTTCTGTATTCAAATCTTTCC 
TTCATTTTTTTAAATTTTTTTTTTGGCAGCGCTTGTGCTGGAACTTACTCATTGTAACTGAATCCTCAGGGCTTTTCTTGTTTT 
AGATCATGGACTGTGCACGTGACACTTAAATAATTTTCTATGTATTTAAAGAAAAATGCACCAGGATGGTGTCTGTGCACGTGA 

40 CTATTAGAGGAGCGTCTGTAGAAGTACCTGGTTTGGTCAGTGCAGTTGTGCAATCTGAGGGCCTTGTTTCCTCCTCCCCTTTCC 
CCTTCTCCCCACCAAAGGAAAATATCCCTCTTAATGATTTCGTAGTTCAGTTTACTGAATGATTACCACCTGTAATTCCTCTTT 
GGATTGTGTAGACTCAACATGAGACATTCCTTTCTGCTTTCTGGAGGGCACCAGGGGCCTTTCTCTTTGATAAATTTTTTTTGT 
CTGTTGACAAAAACAAAAATCTTTTTTCAAATGTAGTGCTGGTGAAAAGGTAGGGCTGAGTGATTACCTTAGCCACAGGGTGGC 
TGAGCAGGAACTTTAGAAGAAAATCCTGAGCTTTCCTGTCCATTCCCAGCATCCAGCTCCTATTCTAGTGCCTCTTCCCTGCAG 

45 GGCAGGGACCCCTTGGGAAATCGAGGAGGTGGGACGGGCTGGGCCCTGTGTCCCAGGTTTCACAGGGCTCAGGGTTATGCTCCC 
GCTTGAATCTGGACGTGAATCTGGTAAAAATATCAAGTACCTGTGGMCTCCCTGATTCTATACCCTCTTCCTTCTTTCTGCAA 
GGCAGAGGAATAATATTTTTAAAGGTTATTTTGTTTTAGTTTTAAATAGCAAAACACAAGCTGCATTTTTATTTATTTTGCATA 
AGAAAGGTAAATCTTTTTACAAAAAAAAGTATAGAGTTGGAAACTCTGGGAAAACTTACGGAAATACACAAATGCTTCTCTGTA 
ATGTGCMTATGCTTTGCAACTGTAGATGATATTTTATGTTTAATCTGTAAATAAGAAATGTATTTAAATTAAAAGGGATCTTT 

50 TTGTAAAAGGACCAAATGTTCTTTTATAAATGTAATAAGGAATATCTTGCTCTTTAAAATTTATTAGGATTTTTATGAGTAATT 
TTTATTAAAAGATTTCTTTTTTTG 



WO 02/46465 



PCT/GB01/05458 



268 

9 

MACEIMPLQSSQEDERPLSPFYLSAHW 

QSKEQDEVRHGRDKGLINKENTPSGFNHLDDCILNTQEVEKVHKNTFGCAGERSKPKRQKSSTKLSELHDNQDGLWMESLNST 
RSHERTGPDDFEWMSDERKGNEKDGGH^ 
5 PAFSSWQRENSDSDEAHLSPQAGRLIRQLLDEDSDPMLSPRFYAYGQSRQYLDDTEVPPSPPNSHSFMRRRSSSLGSYDDEQED 
LTPAQLTRRIQSLKKKIRKFEDRFEEEKKYRPSHSDKAMPEVLKWTNDLAKFRRQLKESKLKISEEDLTPRMRQRSNTLPKSF 
GSQLEKEDEKKQELVDKAIKPSVEATLESIQRKLQEKRAESSRPEDIKDMTKDQIANEKVALQKALLYYESIHGRPVTKNERQV 
MKPLYDRYRLVKQILSRANTIPIIGSPSSKRRSPLLQPIIEGETASFFKEIKEEEEGSEDDSNVKPDFMVTLKTDFSARCFLDQ 
FEDDADGFISPMDDKIPSKCSQDTGLSNLHAASIPELL^ 
10 YKHIKAKLRLLEVL I SKRDTDSKSM 

10 

AAATGTAGAGAAGCAGCCGATAAAATAGCATTGCCTGAAGAAGTTTGGAGGCTGAGAGCAGCAGTAGACTGGCCAACTQCAGAG 
CAAGTTGTTTCTCCAGCCGTGCGGTGCAGCCTCATGCCCCCAACCCAGCTTAGCCACTGTAAGAAGACGTTCACTGTACAGACG 
ACCAAACTTGCCGTGGAAGAGACAGTTGTGAGATTCCCTTGCAAATTTACATACGAGAATGGCTTGTGAAATCATGCCTCTGCA 

15 AAGTTCACAGGAAGATGAAAGACCTCTGTCACCTTTCTATTTGAGTGCTCATGTACCCCAAGTCAGCAATGTGTCTGCAACCGG 
AGAACTCTTAGAAAGAACCATCCGATCAGCTGTAGAACAACATCTTTTTGATGTTAATAACTCTGGAGGTCAAAGTTCAGAGGA 
CTCAGAATCTGGAACACTATCAGCATCTTCTGCCACATCTGCCAGACAGCGCCGCCGCCAGTCCAAGGAGCAGGATGAAGTTCG 
ACATGGGAGAGACAAGGGACTTATCAACAAAGAAAATACTCCTTCTGGGTTCAACCACCTTGATGATTGTATTTTGAATACTCA 
GGAAGTCGAAAAGGTACACAAAAATACTTTTGGTTGTGCTGGAGAAAGGAGCAAGCCTAAACGTCAGAAATCCAGTACTAAACT 

20 TTCTGAGCTTCATGACAATCAGGACGGTCTTGTGAATATGGAAAGTCTCAATTCCACACGATCTCATGAGAGAACTGGACCTGA 
TGATTTTGAATGGATGTCTGATGAAAGGAAAGGAAATGAAAAAGATGGTGGACACACTCAGCATTTTGAGAGCCCCACAATGAA 
GATCCAGGAGCATCCCAGCCTATCTGACACCAAACAGCAGAGAAATCAAGATGCCGGTGACCAGGAGGAGAGCTTTGTCTCCGA 
AGTGCCCCAGTCGGACCTGACTGCATTGTGTGATGAAAAGAACTGGGAAGAGCCTATCCCTGCTTTCTCCTCCTGGCAGCGGGA 
GAACAGTGACTCTGATGAAGCCCACCTCTCGCCGCAGGCTGGGCGCCTGATCCGTCAGCTGCTGGACGAAGACAGCGACCCCAT 

25 GCTCTCTCCTCGGTTCTACGCTTATGGGCAGAGCAGGCAATACCTGGATGACACAGAAGTGCCTCCTTCCCCACCAAACTCCCA 
TTCTTTCATGAGGCGGCGAAGCTCCTCTCTGGGGTCCTATGATGATGAGCAAGAGGACCTGACACCTGCCCAGCTCACACGAAG 
GATTCAGAGCCTTAAAAAGAAGATCCGGAAGTTTGAAGATAGATTCGAAGAAGAGAAGAAGTACAGACCTTCCCACAGTGACAA 
AGCAGCCAATCCGGAGGTTCTGAAATGGACAAATGACCTTGCCAAATTCCGGAGACAACTTAAAGAATCAAAACTAAAGATATC 
TGAAGAGGACCTAACTCCCAGGATGCGGCAGCGAAGCAACACACTCCCCAAGAGTTTTGGTTCCCAACTTGAGAAAGAAGATGA 

30 GAAGAAGCAAGAGCTGGTGGATAAAGCAATAAAGCCCAGTGTTGAAGCCACATTGGAATCTATTCAGAGGAAGCTCCAGGAGAA 
GCGAGCGGAAAGCAGCCGCCCTGAGGACATTAAGGATATGACCAAAGACCAGATTGCTAATGAGAAAGTGGCTCTGCAGAAAGC 
TCTGTTATATTATGAAAGCATTCATGGACGGCCGGTAACAAAGAACGAACGGCAGGTGATGAAGCCACTATACGACAGGTACCG 
GCTGGTCAAACAGATCCTCTCCCGAGCTAACACCATACCCATCATTGGTTCCCCCTCCAGCAAGCGGAGMGCCCTTTGCTGCA 
GCCAATTATCGAGGGCGAAACTGCTTCCTTCTTCAAGGAGATAAAGGAAGAAGAGGAGGGGTCAGAAGACGATAGCAATGTGAA 

35 GCCAGACTTCATGGTCACTCTGAAAACCGATTTCAGTGCACGATGCTTTCTGGACCAATTCGAAGATGACGCTGATGGATTTAT 
TTCCCCAATGGATGATAAAATACCATCAAAATGCAGCCAGGACACAGGGCTTTCAAATCTCCATGCTGCCTCAATACCTGAACT 
CCTGGAACACCTCCAGGAAATGAGAGAAGAAAAGAAAAGGATTCGAAAGAAACTTCGGGATTTTGAAGACAACTTTTTCAGAC 
GAATGGAAGAAATGTCCAGAAGGAAGACCGCACTCCTATGGCTGAAGAATACAGTGAATATAAGCACATAAAGGCGAAACTGAG 
GCTCCTGGAGGTGCTCATCAGCAAGAGAGACACTGATTCCAAGTCCATGTGAGGGGCATGGCCAAGCACAGGGGGCTGGCAGCT 

40 GCGGTGAGAGTTTACTGTCCCCAGAGAAAGTGCAGCTCTGGAAGGCAGCCTTGGGGCTGGCCCTGCAAAGCATGCAGCCCTTCT 
GCCTCTAGACCATTTGGCATCGGCTCCTGTTTCCATTGCCTGCCTTAGAAACTGGCTGGAAGAAGACAATGTGACCTGACTTAG 
GCATTTTGTAATTGGAAAGTCAAGACTGCAGTATGTGCACATGCGCACGCGCATGCACGCACACACACACACAGTAGTGGAGCT 
TTCCTAACACTAGCAGAGATTAATCACTACATTAGACAACACTCATCTACAGAGAATATACACTGTTCTTCCCTGGATAACTGA 
GAAACAAGAGACCATTCTCTGTCTAACTGTGATAAAAACAAGCTCAGGACTTTATTCTATAGAGCAAACTTGCTGTGGAGGGCC 

45 ATGCTCTCCTTGGACCCAGTTAACTGCAAACGTGCATTGGAGCCCTATTTGCTGCCGCTGCCATTCTAGTGACCTTTCCACAGA 
GCTGCGCCTTCCTCACGTGTGTGAAAGGTTTTCCCCTTCAGCCCTCAGGTAGATGGAAGCTGCATCTGCCCACGATGGCAGTGC 
AGTCATCATCTTCAGGATGTTTCTTCAGGACTTCCTCAGCTGACAAGGAATTTTGGTCCCTGCCTAGGACCGGGTCATCTGCAG 
AGGACAGAGAGATGGTAAGCAGCTGTATGAATGCTGATTTTAAAACCAGGTCATGGGAGAAGAGCCTGGAGATTCTTTCCTGAA 
CACTGACTGCACTTACCAGTCTGATTTTATCGTCAAACACCAAGCCAGGCTAGCATGCTCATGGCAATCTGTTTGGGGCTGTTT 

50 TGTTGTGGCACTAGCCAAACATAAAGGGGCTTAAGTCAGCCTGCATACAGAGGATCGGGGAGAGAAGGGGCCTGTGTTCTCAGC 
CTCCTGAGTACTTACCAGAGTTTAATTTTTTTAAAAAAAATCTGCACTAAAATCCCCAAACTGACAGGTAAATGTAGCCCTCAG 
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AGCTCAGCCCAAGGCAGAATCTAAATCACACTATTTTCGAGATCATGTATAAAAAGAAAAAAAAGAAGTCATGCTGTGTGGCCA 
ATTATAATTTTTTTCAAAGACTTTGTCACAAAACTGTCTATATTAGACATTTTGGAGGGACCAGGAAATGTAAGACACCAAATC 
CTCCATCTCTTCAGTGTGCCTGATGTCACCTCATGATTTGCTGTTACTTTTTTAACTCCTGCGCCAAGGACAGTGGGTTCTGTG 
TCCACCTTTGTGCTTTGCGAGGCCGAGCCCAGGCATCTGCTCGCCTGCCACGGCTGACCAGAGAAGGTGCTTCAGGAGCTCTGC 
5 CTTAGACGACGTGTTACAGTATGA 

AAGGAAAAGAAAAAGAATGTAAAACTATACTGACCCGTTTTCAGTTTTAAAGGGTCGTGAGAAACTGGCTGGTCCAATGGGATT 
TACAGCAACATTTTCCATTGCTGAAGTGAGGTAGCAGCTCTCTTCTGTCAGCTGAATGTTAAGGATGGGGAAAAAGAATGCCTT 
TAAGTTTGCTCTTAATCGTATGGAAGCTTGAGCTATGTGTTGGAAGTGCCCTGGTTAATCCATACACAAAGACGGTACATAATC 
CTACAGGTTTAAATGTACATAAAAATATAGTTTGGAATTCTTTGCTCTACTGTTTACATTGCAGATTGCTATAATTTCAAGGAG 
10 TGAGATTATAAATAAAATGATGCACTTTAGGATGTTTCCTATTTTTGAAATCTGAACATGAATCATTCACATGACCAAAAATTG 
TGTTTTTTTAAAAATACATGTCTAGTCTGTCCTTTAATAGCTCTCTTAAATAAGCTATGATATTAATCAGATCATTACCAGTTA 
GCTTTTAMGCACATTTGTTTAAGACTATGTTTTTGGAAAAATACGCTACAGAATTTTTTTTTAAGCTACAAATAAATGAGATG 
CTACTAATTGTTTTGGAATCTGTTGTTTCTGCCAAAGGTAAATTAACTAAAGATTTATTCAGGAATCCCCATTTGAATTTGTAT 
GATTCAATAAAAGAAAACACCAAGTAAGTTATATAAAAT 

15 11 

MLKGVQRKRADKYWEYTFKVNWSDLSVTTVTKTHQELQEFLLKLPKELSSETFDKTILRALNQGS 
SSSSQAFLQSQKVHSFFQSISSDSLHSINNLQSSLKTSKILEHLKEDSSEASSQEEDVLQHAIIHKKHTGKSPIV^ 
LDGLTMQYSEQNGIVDWRKQSCTTIQHPEHCVTSADQHSAEKRSLSSINKKKGKPQTEKEKIKKTDNRLNSRINGIRLSTPQHA 
HGGTVKDVNLDIGSGHDTCGETSSESYSSPSSPRHDGRESFESEEEKDRDTDSNSEDSGNPSTTRFTGYGSVNQTVTVKPPVQI 

20 ASLGNENGNLLEDPLNSPKYQHISFMPTLHCVMHNGAQKSEVWPAPKPADGKTIGMLVPSPVAISAIRESANSTPVGILGPTA 
CTGESEKHLELLASPLPIPSTFLPHSSTPALHLTVQRLKLPPPQGSSESCTVNIPQQPPGSLSIASPNTAFIPIHNPGSFPGSP 
VATTDPITKSASQWGLNQMVPQIEGNTGTVPQPTNVKWLPAAGLSAAQPPASYPLPGSPLAAGVLPSQNSSVLSTAATSPQP 
ASAGISQAQATVPPAVPTHTPGPAPSPSPALTHSTAQSDSTSYISAVGNTNANGTWPPQQMGSGPCGSCGRRCSCGTNGNLQL 
NSYYYPNPMPGPMYRVPSFFTLPSICNGSYLNQAHQSNGNQLPFFLPQTPYANGLVHDPWGSQANYGMQQMAGFGRF 

25 PNWANTSGSGPKKNGNVSCYNCGVSGHYAQDCKQSSMEANQQGTYRLRYAPPLPPSNDTLDSAD 

12 

ATCGGAAAAGCACCTTGAGTTACTGGCTTCCCCTTTACCTATTCCATCAACCTTCCTTCCACACAGTAGTACTCCCGCTTTGCA 
TCTTACAGTTCAGAGGCTAAAGTTGCCACCACCACAGGGATCTTCTGAGAGCTGCACAGTTAACATCCCACAACAACCACCCGG 
AAGCCTGAGCATCGCATCACCAAACACTGCCTTTATTCCTATCCATAACCGAGGTAGTTTCCCAGGCTCTCCTGTTGCTACCAC 

30 GGACCCCATCACAAAATCTGCATCCCAAGTGGTAGGACTCAATCAAATGGTGCCTCAAATTGAGGGAAACACAGGGACAGTCCC 
TCAGCCTACCAATGTGAAGGTAGTTCTTCCAGCAGCTGGCCTCTCAGCTGCTCAGCCACCAGCTTCCTACCCCTTACCAGGCTC 
TCCCCTTGCTGCCGGCGTGTTACCCAGCCAGAACTCCAGTGTGCTCAGCACAGCAGCAACTTCTCCCCAGCCAGCGAGCGCAGG 
TATCAGCCAGGCCCAGGCAACTGTTCCTCCTGCAGTTCCTACCCACACCCCAGGCCCTGCCCCGAGCCCAAGCCCTGCCTTGAC 
ACACAGTACCGCGCAGAGTGACAGCACCTCTTACATCAGTGCTGTGGGGAACACGAACGCTAATGGGACAGTAGTGCCACCGCA 

35 GCAGATGGGCTCAGGTCCTTGTGGTTOTTC 

TTATCCTAATCCAATGCCTGGACCAATGTACCGAGTCCCTTCATTCTTTACTCTGCCATCCATTTGCAATGGCAGCTACCTCAA 
CCAAGCACATCAGAGCAATGGAAACCAACTTCCTTTTTTTCTGCCTCAGACTCCATATGCAAATGGACTGGTACATGACCCAGT 
CATGGGGAGCCAAGCCAACTATGGCATGCAGCAGATGGCAGGATTTGGGAGATTCTATCCTGTATATCCAGCACCTAACGTAGT 
TGCCAACACCAGTGGTTCGGGGCCCAAGAAGAATGGGAATGTCTCATGTTACAATTGTGGTGTAAGCGGACACTACGCACAGGA 

40 CTGTAAGCAGTCGTCCATGGAGGCCAATCAACAAGGCACTTACAGACTGAGATACGCACCTCCCCTCCCCCCTTCTAATGATAC 
GTTGGATTCTGCAGACTGAAACGAGTAAAGCTTGCCTACTTAATACACTCAAGTGTGGGGAGTCATGGGGTGTGGAGGGGAGGA 
AAGGAAAGGTATTTTGTTTCTTTGTCTATACATT.TCCTAGATTTCTATGCAGTTGGGATTTTTCATTTCTCTTGTACCAATGTC 
CAAAACAAGAAAGAATGCAATGCTTTTGAGCCTCTGGTCTCCTGGTTCAACAACAGGCTTATATGTATGATACATGTAATTTAA 
ACCTTCAGACAAACTTAAATGTTGGTGCGTGCTTTTTTTTTTTTTTACACTGAATACTTGCTGTGTGCAATGTTTACTGAATCT 

CAGCTTTTCTCACTTTCACCCTAAACGACACTTCCTCCCCAGCCAGCCTCACTCTGTCTCCGGCCCGCAGCAGGAGCAGCCAGC 
AGTGCATTCACCCCACTTTTGTAAACTGCTCTGCATATAAACCAAGGGCAGAATGTTTCACCCTGATCTTATGGGAGGAATCGA 
ACTCCCAAAATAGTGTGTATATATGTAATAAACAGCGTCACGTAAATACATATATGCAGTGCTTGTTGTCCAAATAGAAATGAA 
AATAAGTGGAAGAGAGAGGAAGAAGTCAACCATATGAAACTGAAAAMTATGACGTACGAAATGGACAAAAAGCTTTTTCTGAA 
50 ACCAACTTTTTACTTCCATCATCCTTTTTTAGCCTGTTGCTTCAGAGAGACACAAAGTGAACACACTGGTGTGAATGTCGCTCT 
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CTGTGTGCTTGTGTTTGTAATGAAAGTCTACAGCCAATTTTACTTGTCTACCACCGTGTTGTGCTCAAAGAGACACTACTTGAG 
TGAAGATTTCTTCTTTCCCTGTACCAGCTGTTACAGTGTTACGTTGTGTTTAAAATGTGTATGGTTTATTGCAATCTGAACAGA 
GCTATGGGTTTCTACCATAAGTCAGGTTGTTTGTTCCCTAACCTGTCTCTCGTAGCAAAGTCACTTTTATAACAGTTTACCACT 
ATGCTTGATTATAATGTGAAAGGCGGAATTCTGAGTGTGTTAAGATGGTATTAATCATGTCGGTGTCATGTCACTAAGTTTAAT 
5 GCTGCTGTTTTTAAAAAAAAAAAAAGTTTTTTTAAAAAGCCAATCTATGTACTAAATTGCTTCCAGGTAATTTTTGATTTCCTA 
AAGTGCACTGAGGTTATCTGGAAGATTGGGTGTATTTTTTGGTGACTGCTGCATTCATCAGCAATGAACAGTTTCCACTGTATA 
GTCCTAGGGGTCAGGGGGTGGGGGTTTCATTTTCCATTCCTCAGCACAGAGCAGAAATGATAGATTTTTATTGTTTGGAGTAAC 
GTTGGTATGCAGCAGAGGAACATAAACATTTGGTCTTGGTTCAGAAGCCTAACAGATTGCTAGACAAGAGAAAAAACTTGAAGA 
AAAAAGAAGCTTAATTTCATGCTTCATAAGTAGCATTTATATTTATAGCACCAATGTACATTTTGAAACTTTCTTTCAGGGGTG 

10 GGAGTTATGGGGAAGGGGTGGGTGTGAAGGGGTAGATGAAAGCTTTAATTTAGAAAGAAAGTTCAAGTAAAGGAAATTATTTTG 
ATTAAATATATTTTATTTGATCTGGGTATTTTTGGACCACATTATTAAATTAATTGTTAAGCTGCAGTTGAGTTGTTCAAGTGA 
GAGTTTTGATAAGCCACTTATGGGCCGCGTTGTGAATCACTTGCCAGTTGTACTTTATGGAGCTTATTTTATGATTTAAAATAC 
TGTACTGTACATAGGAGGTATGTTACCTTCTCCTTATTTGTATGTTTACCATATACTTTGATATTTGAAATGTTATGTACTGGA 
AAGGCCACTTATATTTCTAGAACAGATTGGATTTTATGCMCCTTTTTTCCTTGMTTAACAGCAATAAAAAAATGAAAM 

15 AAAAAAAAAAAAAAAAAAA 

13 

RIFGASPPRNLAIQKCASRTAAMGSEDHGAQNPSCKIMTF^ 

DIDDWIPAPIQQWTGQSGLFTQYNIQKKAMTVGEYRRLANSEKYCTPRHQDFDDLERKYWKNLTFVSPIYGADISGSLYDDD 
VAQMIGSLRTILDMVERECGTIIEGWTPYLYFGMKTTFAWHTEDMDLYSINYLHFGEPKSWYAIPPEHGKRLERLAIGFFP 

20 GSSQGCDAFLRHKMTLISPIILKKYGIPFSRITQEAGEFMITFPYGYHAGFNHGFNCAESTNFATLRWIDYGKVATQCTCRKDM 
VKISMDVFVRILQPERYELWKQGKDLTVLDHTRPTALTSPELSSWSASRASLKAKLLRRSHRKRSQPKKPKPEDPKFPGEGTAG 
AALLEEAGGSVKEEAGPEVDPEEEEEEPQPLPHGREAEGAEEDGRGKLRPTKAKSERKKKSFGLLPPQLPPPPAHFPSEEALWL 
PSPLEPPVLGPGPAAMEESPLPAPLNWPPEVPSEELEAKPRPIIPMLYWPRPGKMFNQEHVSCQQAFEHFAQKGPTWKEPV 
SPMELTGPEDGAAS SGAGRMETKARAGEGQAP STF SKLKMEIKKSRRHPLGRPPTRSPLSVVKQEAS SDEEASPF SGEEDVSDP 

25 DALRPLLSLQWKNRAASFQAERKFNAAAARTEPYCAICTLFYPYCQALQTEKEAPIASLGEGCPATLPSKSRQKTRPLIPEMCF 
TSGGENTEPLPANSYIGDDGTSPLIACGKCCLQVHASCYGIRPELVNEGWTCSRCAAHAWTAECCLCNLRGGALQMTTDRRWIH 
VICAIAVPEARFLWIERHPVDISAIPEQRWKLKCWCRKRMKKVSGACIQCSYEHCSTSFHVTCAHMGVLMEPDDWPYWSI 
TCLKHKSGGHAVQLLRAVSLGQWITKNRNGLYYRCRVIGAASQTCYEWFDDGSYSDNLYPESITSRDCVQLGPPSEGELVEL 
RWTDGNLYKAKFISSVTSHIYQVEFEDGSQLTVKRGDIFTLEEELPKRVRSRLSLSTGAPQEPAFSGEEAKAAKRPRVGTPLAT 

30 EDSGRSQDYVAFVESLLQVQGRPGAPF 

14 

GTCGCCAGCAACCGAGCGGGGCACGCCCGAGCGGGGCCTGGGGGTGCGAGCCGAGGGCGGGGGAGAGCGCGCCGCTGCTCCCGG 
ACCGGGCCGCGCACGCCGCCTCAGGAACCATCACTGTTGCTGGAGGCACCTGACAAATCCTAGCGAATTTTTGGAGCATCTCCA 
CCCAGGAACCTCGCCATCCAGAAGTGTGCTTCCCGCACAGCTGCAGCCATGGGGTCTGAGGACCACGGCGCCCAGAACCCCAGC 

35 TGTAAAATCATGACGTTTCGCCCAACCATGGAAGAATTTAAAGACTTCAACAAATACGTGGCCTACATAGAGTCGCAGGGAGCC 
CACCGGGCGGGCCTGGCCAAGATCATCCCCCCGAAGGAGTGGAAGCCGCGGCAGACGTATGATGACATCGACGACGTGGTGATC 
CCGGCGCCCATCCAGCAGGTGGTGACGGGCCAGTCGGGCCTCTTCACGCAGTACAATATCCAGAAGAAGGCCATGACAGTGGGC 
GAGTACCGCCGCCTGGCCAACAGCGAGAAGTACTGTACCCCGCGGCACCAGGACTTTGATGACCTTGAACGCAAATACTGGAAG 
AACCTCACCTTTGTCTCCCCGATCTACGGGGCTGACATCAGCGGCTCTTTGTATGATGACGACGTGGCCCAGTGGAACATCGGG 

40 AGCCTCCGGACCATCCTGGACATGGTGGAGCGCGAGTGCGGCACCATCATCGAGGGCGTGAACACGCCCTACCTGTACTTCGGC 
ATGTGGAAGACCACCTTCGCCTGGCACACCGAGGACATGGACCTGTACAGCATCAACTACCTGCACTTTGGGGAGCCTAAGTCC 
TGGTACGCCATCCCACCAGAGCACGGCAAGCGCCTGGAGCGGCTGGCCATCGGCTTCTTCCCCGGGAGCTCGCAGGGCTGCGAC 
GCCTTCCTGCGGCATAAGATGACCCTCATCTCGCCCATCATCCTGAAGAAGTACGGGATCCCCTTCAGCCGGATCACGCAGGAG 
GCCGGGGAATTCATGATCACATTTCCCTACGGCTACCACGCCGGCTTCAATCACGGGTTCAACTGCGCAGAATCTACCAACTTC 

45 GCCACCCTGCGGTGGATTGACTACGGCAAAGTGGCCACTCAGTGCACGTGCCGGAAGGACATGGTCAAGATCTCCATGGACGTG 
TTCGTGCGCATCCTGCAGCCCGAGCGCTACGAGCTGTGGAAGCAGGGCAAGGACCTCACGGTGCTGGACCACACGCGGCCCACG 
GCGCTCACCAGCCCCGAGCTGAGCTCCTGGAGTGCATCCCGGGCCTCGCTGAAGGCCAAGCTCCTCCGCAGGTCTCACCGGAAA 
CGGAGCCAGCCCAAGAAGCCGAAGCCCGAAGACCCCAAGTTCCCTGGGGAGGGTACGGCTGGGGCAGCGCTCCTAGAGGAGGCT 
GGGGGCAGCGTGAAGGAGGAGGCTGGGCCGGAGGTTGACCCCGAGGAGGAGGAGGAGGAGCCGCAGCCACTGCCACACGGCCGG 

50 GAGGCCGAGGGCGCAGAAGAGGACGGGAGGGGCAAGCTGCGGCCAACCAAGGCCAAGAGCGAGCGGAAGAAGAAGAGCTTCGGC 
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CTGCTGCCCCCACAGCTGCCGCCCCCGCCTGCTCACTTCCCCTCAGAGGAGGCGCTGTGGCTGCCATCCCCACTGGAGCCCCCG 
GTGCTGGGCCCAGGCCCTGCAGCCATGGAGGAGAGCCCCCTGCCGGCACCCCTTAATGTCGTGCCCCCTGAGGTGCCCAGTGAG 
GAGCTAGAGGCCAAGCCTCGGCCCATCATCCCCATGCTGTACGTGGTGCCGCGGCCGGGCAAGGCAGCCTTCAACCAGGAGCAC 
GTGTCCTGCCAGCAGGCCTTTGAGCACTTTGCCCAGAAGGGTCCGACCTGGAAGGAACCAGTTTCCCCCATGGAGCTGACGGGG 
5 CCAGAGGACGGTGCAGCCAGCAGTGGGGCAGGTCGCATGGAGACCAAAGCCCGGGCCGGAGAGGGGCAGGCACCGTCCACATTT 
TCCAAATTGAAGATGGAGATCAAGAAGAGCCGGCGCCATCCCCTGGGCCGGCCGCCCACCCGGTCCCCACTGTCGGTGGTGAAG 
CAGGAGGCCTCAAGTGACGAGGAGGCATCCCCTTTCTCCGGGGAGGAAGATGTGAGTGACCCGGACGCCTTGAGGCCGCTGCTG 
TCTCTGCAGTGGAAGAACAGGGCGGCCAGCTTCCAGGCCGAGAGGAAGTTCAACGCAGCGGCTGCGCGCACGGAGCCCTACTGC 
GCCATCTGCACGCTCTTCTACCCCTACTGCCAGGCCCTACAGACTGAGAAGGAGGCACCCATAGCCTCCCTCGGAAAGGGCTGC 

10 CCGGCCACATTACCCTCCAAAAGCCGTCAGAAGACCCGACCGCTCATCCCTGAGATGTGCTTCACCTCTGGCGGTGAGAACACG 
GAGCCGCTGCCTGCCAACTCCTACATCGGCGACGACGGGACCAGCCCCCTGATCGCCTGCGGCAAGTGCTGCCTGCAGGTCCAT 
GCCAGTTGGTATGGCATCCGTCCCGAGCTGGTCAATGAAGGCTGGACGTGTTCCCGGTGCGCGGCCCACGCCTGGACTGCGGAG 
TGCTGCCTGTGCAACCTGCGAGGAGGTGCGCTGCAGATGACCACCGATAGGAGGTGGATCCACGTGATCTGTGCCATCGCAGTC 
CCCGAGGCGCGCTTCCTGAACGTGATTGAGCGCCACCCTGTGGACATCAGCGCCATCCCCGAGCAGCGGTGGAAGCTGAAATGC 

15 GTGTACTGCCGGAAGCGGATGAAGAAGGTGTCAGGTGCCTGTATCCAGTGCTCCTACGAGCACTGCTCCACGTCCTTCCACGTG 
ACCTGCGCCCACGCCGCAGGCGTGCTCATGGAGCCGGACGACTGGCCCTATGTGGTCTCCATCACCTGCCTCAAGCACAAGTCG 
GGGGGTCACGCTGTCCAACTCCTGAGGGCCGTGTCCCTAGGCCAGGTGGTCATCACCAAGAACCGCAACGGGCTGTACTACCGC 
TGTCGCGTCATCGGTGCCGCCTCGCAGACCTGCTACGAAGTGAACTTCGACGATGGCTCCTACAGCGACAACCTGTACCCTGAG 
AGCATCACGAGTAGGGACTGTGTCCAGCTGGGACCCCCTTCCGAGGGGGAGCTGGTGGAGCTCCGGTGGACTGACGGCAACCTC 

20 TACAAGGCCAAGTTCATCTCCTCCGTCACCAGCCACATCTACCAGGTGGAGTTTGAGGACGGGTCCCAGGTGACGGTGAAGCGT 
GGGGACATCTTCACCCTGGAGGAGGAGCTGCCCAAGAGGGTCCGCTCTCGGCTGTCACTGAGCACGGGGGCACCGCAGGAGCCC 
GCCTTCTCGGGGGAGGAGGCCAAGGCCGCCAAGCGCCCGCGTGTGGGCACCCCGCTTGCCACGGAGGACTCCGGGCGGAGCCAG 
GACTACGTGGCCTTCGTGGAGAGCCTCCTGCAGGTGCAGGGCCGGCCCGGAGCCCCCTTCTAGGACAGCTGGCCGCTCAGGCGA 
CCCTCAGCCCGGCGGGGAGGCCATGGCATGCCCCGGGCGTTCGCTTGCTGTGAATTCCTGTCCTCGTGTCCCCGACCCCCGAGA 

25 GGCCACCTCCAAGCCGCGGGTGCCCCCTAGGGCGACAGGAGCCAGCGGGACGCCGCACGCGGCCCCAGACTCAGGGAGCAGGGC 
CAGGCGGGCTCGGGGGCCGGCCAGGGGAGCACCCCACTCAACTACTCAGAATTTTAAACCATGTAAGCTCTCTTCTTCTCGAAA 
AGGTGCTACTGCAATGCCCTACTGAGCAACCTTTGAGATTGTCACTTCTGTACATAAACCACCTTTGTGAGGCTCTTTCTATAA 
ATACATATTGTTTAAAAAAAAGCAAGAAAAAAAGGAAAACAAAGGA 

AGAAAAACAAAACAAAACAAACACATTGTTTTTCTCAGAACCAGGATTCTCTGAGAGGTCAGAGCATCTCGCTGTTTTTTTGTT 

30 GTTGTTTTAAAATATTATGATTTGGCTACAGACCAGGCAGGGAAAGAGACCCGGTAATTGGAGGGTGAGCCTCGGGGGGGGGGC 
AGGACGCCCCGGTTTCGGCACAGCCCGGTCACTCACGGCCTCGCTCTCGCCTCACCCCGGCTCCTGGGCTTTGATGGTCTGGTG 
CCAGTGCCTGTGCCCACTCTGTGCCTGCTGGGAGGAGGCCCAGGCTCTCTGGTGGCCGCCCCTGTGCACCTGGCCAGGGGAAGC 
CCGGGGGTCTGGGGCCTCCCTCCGTCTGCGCCCACCTTTGCAGAATAAACTCTCTCCTGGGGTTTGTCTATCTTTGTTTCTCTC 
ACCTGAGAGAAACGCAGGTGTTCCAGAGGCTTCCTTGCAGACAAAGCACCCCTGCACCTCCTATGGCTCAGGATGAGGGAGGCC 

35 CCCAGGCCCTTCTGGTTGGTAGTGAGTGTGGACAGCTTCCCAGCTCTTCGGGTACAACCCTGAGCAGGTCGGGGGACACAGGGC 
CGAGGCAGGCCTTCGGGGCCCCTTTCGCCTGCTTCCGGGCAGGGACGAGGCCTGGTGTCCTCGCTCCACCCACCCACGCTGCTG 
TCACCTGAGGGGAATCTGCTTCTTAGGAGTGGGTTGAGCTGATAGAGAAAAAACGGCCTTCAGCCCAGGCTGGGAAGCGCCTTC 
TCCAGGTGCCTCTCCCTCACCAGCTCTGCACCCCTCTGGGGAGCCTTCCCCACCTTAGCTGTCTCCTGCCCCAGGGAGGGATGG 
AGGAGATAATTTGCTTATATTAAAAACAAAAAATGGCTGAGGCAGGAGTTTGGGACCAGCCTGGGCTATATAGCAAGACCCCAT 

40 CACTACAAATTTTTTACAAATTAGCTAGGTGTGGTGGTGCGCACCTGTGGTCCCAGCTACTCGGGAGGCTGTGGTGGGAGGATT 
GCTTGAGTCCAGGAGGTTGAGGCTGCAGTCAGCTCAGATTGCACCACTGCACTCCAGCCTGGGCAACAGAGCGAGACCCTGTCT 
CCAAAAAAAAAAAAAAGCAATGTTTATATTATAAAAGAGTGTCCTAACAGTCCCCGGGCTAGAGAGGACTAAGGAAAACAGAGA 
GAGTGTTACGCAGGAGCAAGCCTTTCATTTCCTTGGTGGGGGAGGGGGGCGGTTGCCCTGGAGAGGGCCGGGGTCGGGGAGGTT 
GGGGGGTGTCAGCCAAAACGTGGAGGTGTCCCTCTGCACGCAGCCCTCGCCCGGCGTGGCGCTGACACTGTATTCTTATGTTGT 

45 TTGAAAATGCTATTTATATTGTAAAGAAGCGGGCGGGTGCCCCTGCTGCCCTTGTCCCTTGGGGGTCACACCCATCCCCTGGTG 
GGCTCCTGGGCGGCCTGCGCAGATGGGCCACAGAAGGGCAGGCCGGAGCTGCACACTCTCCCCACGAAGGTATCTCTGTGTCTT 
ACTCTGTGCAAAGACGCGGCAAAACCCAGTGCCCTGGTTTTTCCCCACCCGAGATGAAGGATACGCTGTATTTTTTGCCTAATG 
TCCCTGCCTCTAGGTTCATAATGAATTAAAGGTTCATGAACGCTGCG 

15 

50 MTTTVATDYDNIEIQQQYSDVMRWDVDDWDNENSSARLFERSRIKALADEREAVQKKTFTKWVNSHLARVSCRIT 
GRMLIKLLEVLSGERLPKPTKGRMRIHCLEISrTOKALQFLKEQRVHLEMGSHDIVD 
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NKEKRSAKDALLLWCQMKTAGYPIWNi™^ 

PEDISVDHPDEKSIITYWTYYHYFSKMKALAV^ 

VQQQLQAFNTYRTVEKPPKFTEKGNLEVLLFTIQSKM^ 

EQLARRFDRKAAMRETWLSENQRLVSQDNFGFDLPAVEAATKKHEAIETDIAAYEERVQAWAVARELEAENYHDIKRITARKD 
5 WIRLWEYLLELLRARRQRLEMNLGLQKIFQEMLYIMDMDEMKVLVLSQDYGKHLLGVEDLLQKHTLVEADIGIQAERW 
ASAQKFATDGEGYKPCDPQVIRDRVAHMEFCYQELCQLMERRARLEESRRLWKFFWEMAEEEGWIREKEKILSSDDYGKDLTS 
VMRLLSKHRAFEDEMSGRSGHFEQAIKEGEDMIAEEHFGSEKIRERIIYIREQWMLEQLSAIRKKRLEEASLLHQFQADADDI 
DAWMLDILKIVSSSDVGHDEYSTQSLVKKHKDVAEEIANYRPTLDTLHEQASALPQEHAESPDVRGRLSGIEERYKEVAELTRL 
RKQALQDTLALYKMFSEADACELWIDEKEQWLNmQIPEKLEDLEVIQHRFESLEPEMM 
10 EIKAQQDKLNTRWSQFRELVDRKKDALLSALSIQNYHLECNETKSWIREKTKVIESTQDLGNDLAGVMALQRKLTGMERDLVAI 
EAKLSDLQKEAEKLESEHPDQAQAILSRLAEISDWEEMKTTLKNREASLGEASKLQQFLRDLDDFQSWLSRTQTAIASEDMPN 
TLTEAEKLLTQHENIKNEIDNYEEDYQKMRD^^ 

DTKQAEAFLMQEWLAHTEMPTTLEGAEAAIKKQEDFMTTMDANEEKINAWETGRRLVSDGNINSDRIQEKVDSIDDRHRKN 
RETASELLMRLKDNRDLQKFLQDCQELSLWINEKMLTAQDMSYDEAMLHSKWLKHQAFMAELASNKEWLDKIEKEGMQLISEK 

15 PETEAWKEKLTGLHKMWEVLESTTQTKAQRLFDMKAELFTQSCADLDKWLHGLESQIQSDDYGKHLTSWILLKKQQMLENQ 
MEVRKKEIEELQSQAQALSQEGKSTDEVDSKRLTVQTKFMELLEPLNERKHNLLASKEIHQFNRDVEDEILWVGERMPLATSTD 
HGHNLQTVQLLIKKNQTLQKEIQGHQPRIDDIFERSQNIVTDSSSLSAEAIRQRLADLKQLWGLLIEETEKRHRRLEEAHRAQQ 
.1 YYFDAAEAEAWMSEQELYMSEEKAKDEQSAVSMLKKHQILEQAVEDYAETVHQLSKTSRALVA 

AGLKDLAEERRGKLDERHRLFQLNREVDDLEQWIAEREWAGSHELGQDYEHVTMLQERFREFARDTGNIGQERVDTVNHLADE 

20 LINSGHSDAATIAEWKDGLNEAWADLLELIDTRTQILAASYELHKFYHDAKEIFGRIQDKHKKLPEELGRDQNTVETLQRMHTT 
FEHDIQALGTQWQLQEDAARLQAAYAGDKADDIQKRENEVLEAWKSLLDACESRRVRLVDTGDKFRFFSMVRDLMLWMEDVIR 
QIEAQEKPRDVSSVELLMMHQGIKAEIDARNDSFTTCIELGKSLLARKHYASEEIKEKLLQLTEKRKEMIDKWEDRWEWLRLI 
LEVHQFSRDASVAEAWLLGQEPYLSSREIGQSVDEVEKLIKRHEAFEKSAATWDERFSALERLTTLELLEVRRQQEEEERKRRP 
PSPEPSTKVSEEAESQQQWDTSKGEQVSQNGLPAEQGSPRMAETVDTSEMVNGATEQRTSSKESSPIPSP 

25 AATLPARTQETPSAQMEGFLNRKHEWEAHNKKASSRSWH1WYCVIMQEMGFYKDAKTAASGIPYHSEVPVSLKEAVCEVALDY 
KKKKHVFKLRLNDGNEYLFQAKDDEEMTWIQAISSAISSDKHEVSASTQSTPASSRAQTLPTSWTITSESSPGKREKDKEKD 
KEKRFSLFGKKK 

16 

GTGAGCTGAAGCAGGGCAGGGCATCAACTCACCCAGGAAGTGCAAGGGGTTTGGGGATTTTCCTTTCCTAGCCAAGGGAAGGCA 

30 TGACAGACTGTACCTGGAAAAACAGGACACTCTTGCCCAAATACTGCACTTTTTGCACAGTCTTAGCAACTGGCAGACCAGGAG 
ATTCTCTCCTGTGCCTGATTCATTGGGTCCCACACCCATAGGGCCTTGCTTACTGCCAGTGCAGCAGTCTGAGATTAACACCCC 
ATCCCCGGGAGAACTCTAAGAAGGAGCTGATGTGGAGGAGCAGCTGAGACAGTTCAAGATGACGACCACAGTAGCCACAGACTA 
TGACAACATTGAGATCCAGCAGCAGTACAGTGATGTCAACAACCGCTGGGATGTCGACGACTGGGACAATGAGAACAGCTCTGC 
GCGGCTTTTTGAGCGGTCCCGCATCAAGGCTCTGGCAGATGAGCGTGAAGCCGTGCAGAAGAAGACCTTCACCAAGTGGGTCAA 

35 TTCCCACCTTGCCCGTGTGTCCTGCCGGATCACAGACCTGTACACTGACCTTCGAGATGGACGGATGCTCATCAAGCTGCTGGA 
GGTCCTCTCTGGAGAGAGGCTGCCTAAACCCACCAAGGGACGAATGCGCATCCACTGCTTAGAGAATGTGGACAAGGCCCTTCA 
GTTCCTGAAGGAGCAGAGAGTCCATCTTGAGAACATGGGGTCCCATGACATCGTGGATGGAAACCACCGGCTGACCCTTGGCCT 
CATCTGGACCATCATCCTGCGCTTCCAGATCCAGGATATCAGTGTGGAAACTGAAGACAACAAAGAGAAGAAATCTGCCAAGGA 
TGCATTGCTGTTGTGGTGCCAGATGAAGACAGCTGGGTACCCCAATGTCAACATTCACAATTTCACCACTAGCTGGAGGGACGG 

40 CATGGCCTTCAATGCACTGATACACAAACACCGGCCTGACCTGATAGATTTTGACAAACTAAAGAAATCTAACGCACACTACAA 
CCTGCAGAATGCATTTAATCTGGCAGAACAGCACCTCGGCCTCACTAAACTGTTGGACCCCGAAGACATCAGCGTGGACCATCC 
TGATGAGAAGTCCATAATCACTTATGTGGTGACTTATTACCACTACTTCTCTAAGATGAAGGCCTTAGCTGTTGAAGGAAAACG 
AATTGGAAAGGTGCTTGACAATGCTATTGAAACAGAAAAAATGATTGAAAAGTATGAATCACTTGCCTCTGACCTTCTGGAATG 
GATTGAACAAACCATCATCATTCTGAACAATCGCAAATTTGCCAATTCACTGGTCGGGGTTCAACAGCAGCTTCAGGCATTCAA 

45 CACTTACCGCACTGTGGAGAAACCACCCAAATTTACTGAGAAGGGGAACTTGGAAGTGCTGCTCTTCACCATTCAGAGCAAGAT 
GAGGGCCAACAACCAGAAGGTCTACATGCCCCGGGAGGGGAAGCTCATCTCTGACATCAACAAGGCCTGGGAAAGACTGGAAAA 
AGCGGAACACGAAAGAGAACTGGCTTTGCGGAATGAGCTCATAAGACAGGAGAAACTGGAACAGCTCGCCCGCAGATTTGATCG 
CAAGGCAGCTATGAGGGAGACTTGGCTGAGCGAAAACCAGCGTCTGGTGTCTCAGGACAACTTTGGGTTTGACCTTCCTGCAGT 
TGAGGCCGCCACAAAAAAGCACGAGGCCATTGAGACAGACATTGCCGCATACGAGGAGCGTGTGCAGGCTGTGGTAGCCGTGGC 

50 CAGGGAGCTCGAGGCCGAGAATTACCACGACATCAAGCGCATCACAGCGAGGAAGGACAATGTCATCCGGCTCTGGGAATACCT 
ACTGGAACTGCTCAGGGCCCGGAGACAGCGGCTCGAGATGAACCTGGGGCTGCAGAAGATATTCCAGGAAATGCTCTACATTAT 
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GGACTGGATGGATGAAATGAAGGTGCTAGTATTGTCTCAAGACTATGGCAAACACTTACTTGGTGTGGAAGACCTGTTACAGAA 
GCACACCCTGGTTGAAGCAGACATTGGCATCCAGGCAGAGCGGGTGAGAGGTGTCAATGCCTCCGCCCAGAAGTTCGCAACAGA 
CGGGGAAGGTTACAAGCCCTGTGACCCCCAGGTGATCCGAGACCGCGTGGCCCACATGGAGTTCTGTTATGAAGAGCTTTGCCA 
GCTGGCGGCTGAGCGCAGGGCCCGTCTGGAAGAGTCCCGCCGCCTCTGGAAGTTCTTCTGGGAGATGGCAGAAGAGGAAGGCTG 
5 GATACGGGAGAAGGAGAAGATCCTGTCCTCGGACGATTACGGGAAAGACCTGACCAGCGTCATGCGCCTGCTCAGCAAGCACCG 
GGCGTTCGAGGACGAGATGAGCGGCCGCAGTGGCCACTTTGAGCAGGCCATCAAGGAAGGCGAAGACATGATCGCGGAGGAGCA 
CTTCGGGTCGGAGAAGATCCGTGAGAGGATCATTTACATCCGGGAGCAGTGGGCCAACCTAGAGCAGCTCTCGGCCATTCGGAA 
GAAGCGCCTGGAGGAGGCCTCCCTGCTGCACCAGTTCCAGGCAGATGCTGATGACATTGATGCCTGGATGCTGGACATCCTCAA 
GATTGTCTCCAGCAGCGACGTGGGCCACGATGAGTATTCCACACAGTCTCTGGTCAAGAAACACAAGGACGTGGCGGAAGAGAT 

10 CGCCAATTACAGGCCCACCCTTGACACGCTGCACGAACAAGCCAGCGCCCTCCCCCAGGAGCATGCCGAGTCTCCAGACGTGAG 
GGGCAGGCTGTCGGGCATCGAGGAGCGGTATAAGGAGGTGGCAGAGCTGACGCGGCTGCGGAAGCAGGCACTCCAGGACACTCT 
GGCCCTGTACAAGATGTTCAGCGAGGCTGATGCCTGTGAGCTCTGGATCGACGAGAAGGAGCAGTGGCTCAACAACATGCAGAT 
CCCAGAGAAGCTGGAGGATCTGGAGGTCATCCAGCACAGATTTGAGAGCCTAGAACCAGAAATGAACAACCAGGCTTCCCGGGT 
TGCAGTGGTGAACCAGATTGCACGCCAGCTGATGCACAGCGGCCACCCAAGTGAGAAGGAAATCAAAGCCCAGCAGGACAAACT 

15 CAACACAAGGTGGAGCCAGTTCAGAGAACTGGTTGACAGGAAGAAGGATGCCCTCCTGTCTGCCCTGAGCATCCAGAACTACCA 
CCTCGAGTGCAATGAAACCAAATCCTGGATTCGGGAAAAGACCAAGGTCATCGAGTCCACCCAGGACCTGGGCAATGACCTGGC 
TGGCGTCATGGCCCTGCAGCGCAAGCTGACCGGCATGGAGCGGGACTTGGTGGCCATTGAGGCAAAGCTGAGTGACCTGCAGAA 
GGAGGCGGAGAAGCTGGAGTCCGAGCACCCCGACCAGGCCCAGGCCATCCTGTCTCGGCTGGCCGAGATCAGCGACGTGTGGGA 
GGAGATGAAGACCACCCTGAAAAACCGAGAGGCCTCCCTGGGAGAGGCCAGCAAGCTGCAGCAGTTCCTACGGGACTTGGACGA 

20 CTTCCAGTCCTGGCTCTCTAGGACCCAGACAGCGATCGCCTCGGAGGACATGCCAAACACCCTGACCGAGGCTGAGAAGCTGCT 
CACGCAGCACGAGAACATCAAGAATGAGATCGACAACTACGAGGAGGACTACCAGAAGATGAGGGACATGGGCGAGATGGTCAC 
CCAGGGGCAGACCGATGCCCAGTACATGTTTCTGCGGCAGCGGCTGCAGGCCCTGGACACTGGATGGAACGAGCTCCACAAGAT 
GTGGGAGAACAGACAAAATCTCCTATCCCAGTCACATGCCTACCAGCAGTTCCTCAGAGACACGAAGCAAGCCGAAGCCTTTCT 
TAACAACCAGGAGTATGTTCTGGCTCACACTGAAATGCCTACCACCTTGGAAGGAGCTGAAGCAGCAATTAAAAAGCAAGAGGA 

25 CTTCATGACCACCATGGACGCCAATGAGGAGAAGATCAATGCTGTGGTGGAGACTGGCCGGAGGCTGGTGAGCGATGGGAACAT 
CAACTCAGATCGCATCCAGGAGAAGGTGGACTCTATTGATGACAGACATAGGAAGAATCGTGAGACAGCCAGTGAACTTTTGAT 
GAGGTTGAAGGACAACAGGGATCTACAGAAATTCCTGCAAGATTGTCAAGAGCTGTCTCTCTGGATCAATGAGAAGATGCTCAC 
AGCCCAGGACATGTCTTACGATGAAGCCAGAAATCTGCACAGTAAATGGTTGAAGCATCAAGCATTTATGGCAGAACTTGCATC 
CAACAAAGMTGGCTTGACAAAATCGAGAAGGAAGGAATGCAGCTCATTTCAGAAAAGCCTGAGACGGAAGCTGTGGTGAAGGA 

30 GAAACTCACTGGTTTACATAAAATGTGGGAAGTCCTTGAATCCACTACCCAGACAAAGGCCCAGCGGCTCTTTGATGCAAACAA 
GGCCGAACTTTTCACCCAGAGCTGTGCAGATCTAGACAAATGGCTGCACGGCCTGGAGAGTCAGATTCAGTCTGATGACTATGG 
CAAACACCTGACCAGTGTCAATATCCTGCTGAAAAAGCAACAGATGCTGGAGAATCAGATGGAAGTGCGGAAGAAGGAGATCGA 
AGAGCTCCAAAGCCAAGCCCAGGCCCTGAGTCAGGAAGGGAAGAGCACCGACGAGGTAGACAGCAAGCGCCTCACCGTGCAGAC 
CAAGTTCATGGAGTTGGTGGAGCCCTTGAACGAGAGGAAGCATAACCTGCTGGCCTCCAAAGAGATCCATCAGTTCAACAGGGA 

35 TGTGGAGGACGAGATCTTGTGGGTTGGAGAGAGGATGCCTTTGGCAACTTCCACGGATCATGGCCACAACCTCCAGACTGTGCA 
GCTGTTAATAAAGAAAAATCAGACCCTCCAGAAAGAAATCCAGGGGCACCAGCCTCGCATTGACGACATCTTTGAGAGGAGCCA 
AAACATCGTCACTGACAGCAGCAGCCTCAGCGCTGAGGCCATCAGACAGAGGCTTGCCGACCTGAAGCAGCTGTGGGGTCTCCT 
CATTGAGGAGACAGAGAAACGCCACAGGCGGCTGGAGGAGGCGCACAGGGCCCAGCAGTACTACTTTGACGCTGCTGAGGCCGA 
AGCCTGGATGAGCGAGCAGGAGCTGTACATGATGTCAGAGGAGAAGGCCAAGGATGAGCAGAGTGCTGTCTCCATGTTGAAGAA 

40 GCACCAGATCTTAGAACAAGCTGTGGAGGACTATGCAGAGACCGTGCATCAGCTCTCCAAGACCAGCCGGGCCCTGGTGGCCGA 
CAGCCATCCTGAAAGTGAGCGCATTAGCATGCGGCAGTCCAAAGTGGATAAACTGTACGCTGGTCTGAAAGACCTTGCTGAAGA 
GAGAAGAGGCAAGCTGGATGAGAGACACAGGTTATTCCAGCTCAACCGGGAGGTGGACGACCTGGAGCAGTGGATCGCTGAGAG 
GGAGGTGGTCGCAGGGTCCCATGAACTGGGACAGGACTATGAGCATGTCACGATGTTACAAGAACGATTCCGGGAGTTTGCCCG 
AGACACCGGGAACATTGGGCAGGAGCGCGTGGACACGGTCAATCACCTGGCAGATGAGCTCATCAACTCTGGACATTCAGATGC 

45 CGCCACCATCGCTGAATGGAAGGATGGCCTCAATGAAGCCTGGGCCGACCTCCTGGAGCTCATTGACACAAGAACACAGATTCT 
TGCCGCTTCCTATGAACTGCACAAGTTTTACCACGATGCCAAGGAGATCTTTGGGCGTATACAGGACAAACACAAGAAACTCCC 
TGAGGAGCTTGGGAGAGATCAGAACACAGTGGAGACCTTACAGAGAATGCACACTACATTTGAGCATGACATCCAGGCTCTGGG 
CACACAGGTGAGGCAGCTGCAGGAGGATGCAGCCCGCCTCCAGGCGGCCTATGCGGGTGACAAGGCCGACGATATCCAGAAGCG 
CGAGAACGAGGTCCTGGAAGCCTGGAAGTCCCTCCTGGACGCCTGTGAGAGCCGCAGGGTGCGGCTGGTGGACACAGGGGACAA 

50 GTTCCGCTTCTTCAGCATGGTGCGCGACCTCATGCTCTGGATGGAGGATGTCATCCGGCAGATCGAGGCCCAGGAGAAGCCAAG 
GGATGTATCATCTGTTGAACTOTTAATGAATAATCATCAAGGCATCAAAGCTGAAATTGATGCACGTAATGACAGTTTCACAAC 
CTGCATTGAACTTGGGAAATCCCTGTTGGCGAGAAAACACTATGCATCTGAGGAGATCAAGGAAAAATTACTGCAGTTGACGGA 
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AAAGAGGAAAGAAATGATCGACAAGTGGGAAGACCGATGGGAATGGTTAAGACTGATTCTGGAGGTCCATCAGTTCTCAAGAGA 
CGCCAGTGTGGCCGAGGCCTGGCTGCTTGGACAGGAGCCGTACCTATCCAGCCGAGAGATAGGCCAGAGCGTGGACGAGGTGGA 
GMGCTCATCAAGCGCCACGAGGCATTTGAAAAGTCTGCAGCAACCTGGGATGAGAGGTTCTCTGCCCTGGAAAGGCTGACTAC 
ATTGGAGTTACTGGAAGTGCGCAGACAGCAAGAGGAAGAGGAGAGGAAGAGGCGGCCGCCTTCTCCCGAGCCGAGCACGAAGGT 
5 TTCAGAGGAAGCCGAGTCCCAGCAGCAGTGGGATACTTCAAAAGGAGAACAAGTTTCCCAAAACGGTTTGCCAGCTGAACAGGG 
ATCTCCACGGATGGCAGAAACGGTGGACACAAGCGAAATGGTCAACGGCGCTACAGAACAAAGGACGAGCTCTAAAGAGTCCAG 
CCCCATCCCCTCCCCGACCTCTGATCGTAAAGCCAAGACTGCCCTCCCAGCCCAGAGTGCCGCCACCTTACCAGCCAGAACCCA 
GGAGACACCTTCGGCCCAGATGGAAGGCTTCCTCAATCGGAAACACGAGTGGGAGGCCCACAATAAGAAAGCCTCAAGCAGGTC 
CTGGCACAATGTTTATTGTGTCATAAATAACCAAGAAATGGGTTTCTACAAAGATGCAAAGACTGCTGCTTCTGGAATTCCCTA 

10 CCACAGCGAGGTCCCTGTGAGTTTGAAAGAAGCTGTCTGCGAAGTGGCCCTTGATTACAAAAAGAAGAAACACGTATTCAAGCT 
AAGACTAAATGATGGCAATGAGTACCTCTTCCAAGCCAAAGACGATGAGGAAATGAACACATGGATCCAGGCTATCTCTTCCGC 
CATCTCCTCTGATAAACACGAGGTGTCTGCCAGCACCCAGAGCACGCCAGCATCCAGCCGCGCGCAGACCCTCCGCACCAGCGT 
CGTCACCATCACCAGCGAGTCCAGTCCCGGCAAGCGGGAAAAGGACAAAGAGAAAGACAAAGAGAAGCGGTTCAGCCTTTTTGG 
CAAAAAGAAATGAACTCCTTTCCTTCACCTCCTGCCCTTCTCTTACCTTTTCAGTGAAATTCCAGCATGCAAGCTCAGAACCAA 

15 CACATTACTCTCTGTGCCTAATGTTCCTCAATGTGGTTGATTTATTTTTTTTTTTAATTTATAGAGCATTTCGGGGGGGGTGGG 
G 

17 

MQPPLDLKQILPFPLEPAPTLGLFSNYSTMDPVQKAVLSHTFGGPLLKTKRPVISCNICQIRFNSQSQAEAHYKGNRHARRVKG 
IEAAKTRGREPGVREPGDPAPPGSTPTNGDGVAPRPVSMENGL.GPAPGSPEKQPGSPSPPSIPETGQGVTKGEGGTPAPASLPG 
20 GSKEEEEKAKRLLYCALCKVAVNSLSQLEAHNKGT^ 

EVQLKQHISSRRHRDGVAGKPNPLIiSRHKKSRGAGELAGTLTFSKELPKSLAGGLLPSPLAVAAVMAAMGSPLSLRP 
LQGPPITHPLLHPAPGPIRTAHGPILFSPY 

18 

AGAGCGACGCGGAACCCCGGGCGCCTGGGTCCCCAGCATGATCCTCGGCAGCCTGAGCCGGGCAGGGCCCCTGCCTCTGCTACG 

25 GCAGCCCCCGATCATGCAGCCCCCACTGGACCTCAAGCAGATCCTGCCCTTCCCACTCGAGCCAGCCCCTACCCTTGGCCTCTT 
CAGCAACTACAGCACCATGGACCCTGTGCAGAAGGCTGTGCTCTCCCACACTTTTGGGGGACCCTTGCTCAAGACCAAGCGACC 
CGTCATTTCCTGTAATATCTGTCAAATCCGCTTCAATTCTCAGAGCCAGGCTGAGGCGCACTACAAAGGTAATCGCCACGCCCG 
ACGAGTCAAAGGCATTGAGGCTGCCAAGACCAGAGGCAGGGAGCCTGGCGTCCGAGAACCTGGAGACCCAGCTCCCCCAGGCAG 
CACCCCAACAAATGGGGATGGTGTAGCACCCCGTCCAGGTACTAAGCACAAGACAATTCTGGAGGCCCGAAGTGGGCTCGGGCC 

30 CATCAAAGCTTACCCTCGGCTGGGGCCTCCCACCCCGGGGGAACCAGAGGCTCCTGCCCAGGACCGAACTTTCCACTGTGAGAT 
CTGCAATGTCAAGGTCAACTCGGAGGTCCAACTGAAACAGCACATCTCCAGCCGGCGGCACCGAGACGGCGTGGCCGGGAAGCC 
CAACCCACTACTGAGCCGTCACAAGAAGTCTAGGGGCGCCGGGGAGCTGGCGGGCACGCTGACTTTCTCCAAGGAGCTGCCCAA 
GTCCCTGGCGGGCGGCCTGCTCCCCAGCCCCCTGGCGGTGGCTGCAGTGATGGCAGCGGCAGCAGGCTCGCCGCTGTCCCTGCG 
CCCGGCTCCAGCCGCACCTCTTCTCCAGGGACCGCCGATCACTCACCCTCTGCTTCACCCGGCCCCCGGACCCATCCGAACTGC 

35 GCACGGACCCATCCTCTTCTCCCCGTACTGACCTCAACCCTGAACCCCTCCCATTCAACTCCCCACCTCCAGCCGGGACCCAGG 
CGTCCGGACTCCCAGCCCGCCCCTCCTCCCGGCACTCCCTAAATGATCTCTCCTTCCCCCCCCCCACCCCGAGATACGGGGTTC 
CAGGAAAGGGGAGGGGTAGCGGGGGAGGGGGGCTTCAGAAGGGGGGGAACACCCCAGATCTCAGGGAACCCCGCCCCCTGCCTT 
TCCCTCTCCCCTAGAAAAGGGGGGGCCGTCTCACCCCCGAGCCCCCTTGGAGACACCCCCCCTCCCAAAAGCCATGTCCATCCA 
GCCCTTCCCCCCAAACCTAGCACAAAACGGGGTTCACAAGCCATGGTCGGGGTCCGGGGGGGACAGAAATGGATTTTCTTGGCA 

40 ATAAGCGGACTCTGGGACTCCGGCTCCCTACCCCAAACTGAAGCGCTTCCGTGAACACCCCCGTCCTCCGTAGGGGGAGGGGAG 
CAGGCGGGATCCTGGGTCCCTCATAAGCACTTTGGTTTTACCGCCTGCAACCTCACTGTGCCCGCCCCGCACCATGCCCTAGCC 
CCAGGTCTAGCCGGGCCCATTGCAGGGGGCAGCACTTGGGGGCATCTCCGGCACTTGGGTGGGACCAAGGAGATGCCACCATAG 
ACCTTTCCCTCGCCTTCTTCCTCCCTAGTCCGGGTTC.CATTCTTTTCACCAGCACCCATCGCCCAAGGGGTACCGAGGGGGGCA 
AGGGGTGTCCAGTCCAAGCCCACCCCCGCCTCGCCTTCCGCAAAACTGTGAGCAAAAAGCAATAGAAGCCTCGCCCCCGCCCTT 

45 CCCCTTCGCAGGATTCGCCGAGTCTGTAGCCTCCCCGATTCAAGTTCCTAGACCTCATGGCTGTCCCCTCCCACCAGTCACCTC 
CACTGCACAACTCGGGCGGGGGTGTGACACCTCTCCCCCACCCCCGACTCCGTGGTTTCCGTATCGTCAACCCTTCAGCCGCCG 
ACCCGGGAGGGGTCTGGCCTACACTGGTCTTCCCCTTCCCATCAACTCTTTCTGCTTGACAATGTAGCAACCCAGGCCCCCCAC 
CCACGGCCCTCCCCTTTTTCCTCTCCCTGACAATAAAGTCTGAATTTGTTCTGCCCTCCG 

19 
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MYKSLLDKAGLGSITSVRFLGDQQSVFVSKDLLKPIQDVNSLRLSLTDNQTVSKEFQALIVKHLDESHLLQGDKNLVGSEVRIY 
SLDPSTQWFSATWHGNPSSKTLQVNCEEIPALKIVDPALIHVEWHDNFVTCGNSTRIGAVKRKSSENNGSSVSKQAKSCSEV 
SPSMCPVQSVPTTVCKEILLGCTAATPSSNRQQNTPQAANSPPNIGAKLPQGCHKQSLPEEISSCLNTKSEVLRTKPDVCKAGL 
LSSKSSQVGAGDLKILSEPKGSCIQPKTNTDQESRLESTPQPVTGLTKECLVTKTSSKAELDNATAPELQKRLEHTASTPDGLS 
5 DKPEVEAGVTRLNSCSEKKVGPSDLGSQSQNLKETSVKVDHDSCCTRSSNKTQTPPARKSVLTDPDKLKKLQQSGEAFVQDDSC 
VNIVAQLPKCRECRLDSLRKDKDQQKDSPVFCRFFHFRRLQFNKHGVLRVEGFLTPNKYDSEAIGLWLPLTKNWGTDLDTAKY 
ILMIGDHFCQWISEKEAMSTIEPHRQVAWKRAVKGVREMCDVCDTTIFNLHWVCPRCGFGVCVDCYRLKRKNCQQGAAYKTF 
SWIRCVKSQIHEPENLMPTQIIPGKALYDVGDIVHSVRAKWGIKANCPCSNRQFKLFSKPALKEDLKQASLSGEKPSLG 
SSPVLEPMVCGEAPSKPASNVKPICPMTSPL3MLADLTSGNW 
10 • ILTPVSftMSGFLRNLLNSS 

PMKSNWWFRECWKQGQPVMVSGVHHKLNSELWKPESFRKEFGEQEVDLWCRra 

MVLKLKDWPPGEDFRDMMPSRFDDL1MNIPLPEYTRRDGKLNLASRLPNYFVRPDLGPKMYNAYGLITPEDRKYGTTNLHLDVS 
DAA3WMVWGIPKGQCEQEEEVLKTIQDGDSDELTIKRFIEGKEKPGALWHIYAAKDTEKIREFLKKVSEEQGQENPADH 
DQSWYLDRSLRKRLHQEYGVQGWAIVQFLGDWFIPAGAPHQVHNLYSCIKVAEDFVSPEHVKHCFWLTQEFRYIiSQTHTNHED 
1 5 KLQVKWI YHAVKDAVAMLKAWPVLANLNLPAHWK 

20 

GGAGAGCATTTTTAGTAGAACATAATTTGGTTTTAGCTGAACGAAAGTCACCTGAAATTTCTGAACGAATTGTACAGTGGCCTG 
CAATAACGTACAAACCTCTGTTGGACAAAGCTGGTTTGGGATCCATAACTTCTGTTCGCTTTCTGGGAGATCAACAAAGAGTAT 
TTCTTTCTAAAGACCTTTTGAAGCCTATACAGGATGTAAACAGTCTTCGACTTTCTCTTACGGATAATCAGATTGTCAGTAAAG 

20 AATTTCAAGCTTTGATTGTGAAGCATTTAGATGAAAGCCATCTTTTAAAAGGTGACAAAAACTTAGTTGGTTCAGAAGTAAAAA 
TTTATAGCTTGGACCCATCTACTCAGTGGTTTTCAGCAACCGTTGTAAATGGAAACCCAGCATCAAAAACTCTTCAAGTCAACT 
GTGAGGAGATTCCAGCACTGAAAATTGTTGATCCGTCACTGATTCATGTTGAAGTTGTACACGATAACCTTGTGACATGTGGTA 
ATTCTGCAAGAATTGGAGCTGTAAAACGCAAGTCTTCTGAGAATAATGGAACCCTGGTTTCCAAACAAGCAAAATCTTGCTCTG 
AGGCCTCTCCCAGTATGTGTCCTGTGCAGTCTGTACCTACAACAGTTTTTAAGGAGATACTGCTTGGCTGTACTGCAGCAACTC 

25 CACCTAGTAAGGACCCAAGACAGCAAAGTACTCCCCAGGCTGCCAACTCTCCACCTAACCTTGGAGCAAAAATTCCTCAAGGAT 
GTCATAAACAAAGTTTACCAGAGGAAATTTCTTCCTGTCTAAATACAAAGTCTGAAGCTCTGAGAACAAAACCAGATGTCTGCA 
AAGCAGGGTTGCTCTCAAAGTCCTCTCAGATTGGAACTGGAGACTTGAAAATTCTGACTGAGCCAAAAGGCAGCTGTACTCAGC 
CTAAGACAAACACTGATCAGGAAAACAGATTGGAGTCTGTTCCACAAGCATTGACTGGCCTTCCTAAGGAGTGCTTACCTACAA 
AGGCTTCTTCTAAGGCAGAATTGGAAATTGCCAATCCTCCTGAACTGCAGAAGCACCTAGAACATGCACCTTCCCCATCGGATG 

30 TTTCAAATGCACCAGAAGTGAAAGCAGGTGTCAATAGTGATAGCCCTAATAACTGTTCAGGAAAAAAGGTAGAACCTTCAGCTT 
TAGCTTGCCGATCACAGAATTTAAAGGAATCTTCAGTAAAAGTAGATAATGAAAGCTGTTGTTCAAGAAGCAACAATAAAATCC 
AGAATGCCCCATCCAGGAAGTCGGTTTTGACAGACCCAGCTAAACTCAAAAAGCTGCAACAGAGTGGCGAGGCCTTCGTACAGG 
ATGATTCTTGTGTGAACATCGTGGCACAGTTGCCTAAATGCCGAGAGTGTCGCTTGGACAGTCTCCGCAAGGATAAGGAGCAAC 
AGAAGGACTCACCTGTGTTTTGCCGCTTCTTTCACTTCAGGAGGTTACAATTCAACAAACATGGTGTGTTGCGGGTAGAAGGCT 

35 TCTTAACACCAAACAAGTATGACAATGAAGCAATTGGCTTGTGGTTACCTTTAACCAAAAACGTTGTGGGGATTGATTTGGACA 
CAGCAAAGTACATCTTGGCCAACATTGGAGACCACTTCTGTCAAATGGTGATTTCTGAAAAGGAAGCTATGTCAACTATTGAGC 
CACACAGACAGGTTGCTTGGAAGCGAGCTGTCAAAGGTGTTCGAGAAATGTGTGATGTGTGCGACACCACCATCTTCAACCTGC 
ACTGGGTGTGTCCTCGGTGTGGGTTTGGAGTATGTGTGGACTGCTACCGGATGAAGAGAAAGAATTGCCAACAGGGTGCTGCTT 
ACAAGACTTTCTCTTGGCTAAAATGTGTGAAGAGTCAGATACATGAACCAGAGAACTTAATGCCCACACAGATCATTCCTGGAA 

40 AAGCACTCTATGATGTTGGAGACATTGTTCATTCTGTAAGAGCGAAATGGGGAATAAAGGCAAACTGCCCTTGTTCAAACAGGC 
AATTCAAACTCTTTTCAAAGCCAGCCTCAAAGGAAGACCTAAAACAGACTTCTTTAGCTGGAGAAAAACCGACTCTTGGTGCAG 
TGCTCCAGCAGAATCCCTCAGTGTTGGAGCCAGCAGCTGTGGGTGGGGAAGCAGCCTCCAAGCCAGCCGGCAGCATGAAGCCTG 
CCTGTCCAGCCAGCACATCTCCTCTAAACTGGCTGGCCGACCTAACCAGCGGGAATGTCAACAAGGAAAACAAGGAAAAACAAC 
CAACAATGCCAATTTTAAAGAATGAAATCAAATGCCTTCCACCCCTCCCACCTTTAAGCAAATCCAGCACAGTCCTCCATACGT 

45 TTAACAGCACAATTTTGACACCCGTAAGCAACAACAATTCTGGTTTCCT.CCGGAATCTCTTGAATTCTTCTACAGGAAAGACAG 
AAAATGGACTCAAGAATACACCAAAAATCCTTGATGACATCTTTGCCTCTTTGGTGCAAAATAAGACGACTTCTGATTTATCTA 
AGAGGCCTCAAGGACTAACCATCAAGCCCAGCATTCTGGGCTTTGACACTCCTCACTATTGGCTTTGTGATAATCGCTTGCTGT 
GCTTGCAAGACCCCAACAATAAGAGCAACTGGAATGTGTTTAGGGAGTGCTGGAAACAAGGGCAGCCAGTGATGGTGTCTGGAG 
TGCATCATAAATTGAACTCTGAACTTTGGAAACCTGAATCCTTCAGGAAAGAGTTTGGTGAGCAGGAAGTAGACCTAGTTAATT 

50 GTAGGACCAATGAAATCATCACAGGAGCCACAGTAGGAGACTTCTGGGATGGATTTGAAGATGTTCCAAATCGTTTGAAAAATG 
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AAAAAGAACCAATGGTGTTGAAACTTAAGGACTGGCCACCAGGAGAAGATTTTAGAGATATGATGCCTTCCAGGTTTGATGATC 
TGATGGCCAACATTCCACTGCCCGAGTACAGAAGGCGAGATGGCAAACTGAATTTGGCCTCTAGGCTGCCAAACTACTTTGTTC 
GGCCAGATCTGGGCCCCAAGATGTATAATGCTTATGGATTAATCACTCCTGAAGATCGGAAATATGGAACAACAAATCTTCACT 
TAGATGTATCTGATGCAGCTAATGTCATGGTCTATGTGGGAATTCCCAAAGGACAGTGTGAGCAAGAAGAAGAAGTCCTTAAGA 
5 CCATCCAAGATGGAGATTCTGACGAACTCACAATAAAGCGATTTATTGAAGGAAAAGAGAAGCCAGGAGCACTGTGGCACATAT 
ATGCTGCAAAGGACACGGAGAAGATAAGGGAATTTCTTAAAAAGGTATCAGAAGAGCAAGGTCAAGAAAACCCAGCAGACCACG 
ATCCTATTCATGATCAAAGCTGGTATTTAGACCGATCATTAAGAAAACGTCTTCATCAAGAGTATGGAGTTCAAGGCTGGGCTA 
TTGTACAGTTTCTTGGGGATGTGGTGTTTATCCCGGCAGGAGCTCCACATCAGGTTCATAACTTATATAGCTGCATCAAAGTGG 
CTGAAGATTTTGTTTCTCCAGAGCATGTTAAACACTGCTTCTGGCTTACTCAGGAATTCCGATATCTGTCACAGACTCATACCA 

10 ATCACGAAGATAAATTACAGGTGAAGAATGTTATCTACCATGCAGTGAAAGATGGAGTTGCTATGCTGAAAGCCAGTGAATCCA 
GTTTTGGCAAACCTTAATCTCCCTGCACATTGGAAATGAATTACAGGCAGCTGTTCAAACTCTTCAGGCAGGATTCCTGTGGAC 
TTTGAGATTCATGTTACCTCATCTTCTTTTTTAAACTGTACCCAACTTGTGAGGGTACTCTGTCTAATGTATATTTCTAGTGTT 
TACAGACAGTAAATGTGTATATGTAGTAACTATTTACAGAACATGCATCCTTAAACTGTGACTTCTCACCTAGTGCAGAACTTT 
TACCAGGCTGTAAAAGCAAAACCTCGTATCAGCTCTGGAACAATACCTGCAGTTATTCTTCAGCTGTTTGGACAACTTAGATTG 

15 GGTTTATAACTATTAGGAATCACTGCACAGTTTATTTGGGTTGTGTTTTGTGTCTGAGTCCCCTCCCTCATCCCTTAGGGTCCA 
GAAGAGCAATGGAGGAAGTGACAGCTAATGTTGCAGTTCTTATTGTATGGCATAGGACTGGCATTATATAGCAGAAATCAACTA 
CTGTACAATTTCTTGGGGTTAACCATCTTTAGTTAAATGGAATTTTAATTTAAATGACGCTTTGCTAATTTTAAGTGTTAAGCA 
TTTTGCATTAAAATATTCATATAAT 

21 

20 MKAGGGMAGSVSGHSESLASLSRSPQTKAGQKLQHKNYPVCVLYIFITKFNKQKPLLNQVILSSVKRN 
22 

CAGAAGACTACATTAGTGAGATGTAAGAATTATTAAATATTCCATTTCCGCTTTGGCTACAATTATGAAGAAGTTGAAGGTACT 
TCTTTTAGACCACCAGTAAATAATCCTCCTTCAAAAAATAAAAATAAAAGAAAAAGGAAAATCATTCAGGAAGAAATGACCTGT 
CTAAAAAAACCTAAGGAAGAATAATAATATAAGAAAGGAAATTTAAAAACATTCCACAAGAAGAAAAATTATTGTTTATACTCC 

25 TACTTATGGTTATATCTTATATTCTCTATTCAAGTGACCTGTCTTTTAAAAAGGCAGTGCTGTCTTACCTCTTGCTAGTGGGTT 
AAATGTTTTCAAAAATTATAGCAGTAGTAGAAGTTTTGTATAAAATTTGTCCTTATTTGTTAATTGTATATAAATGTTAATTAT 
TTGATACGAATGTTATGCATTTAGTATGCACATTGAAGTCTAAACTGTAGAAGAGTCTAAAACAAGTTCTCTTTTTGCAGATTC 
ACATACTAATGGTTTAATTCTGTGCTCTGTTTAAAGTACTATTATAACTAGAGTAGATCTGAATGAGGATAACCCTAAAATCAT 
GAGGAATGGAAGAATGGACCTTGAAACTACCTAGGCTTTTATGCATGGCACCTCTTTATAATGAAGACACTTTTTAAAGTTTTT 

30 GTTTTTGTTTCAATTACCGCTAGATTTTTTTTTCTCTTTTTTTAAAATCCATTTTACTGGAAAGTTGGCCAGCAGAGGGAGTAG 
AAATTATTAAAATTCTAGTGTTTGGATTGGGCCCTTCTCTAACAGTACATACTCATTCCCAAAGCAATCCAAAAACAAAATGTG 
AACCATTTGGGTTTCAAATGTTAAGAACACTAAATAGCATGATTTAAAAAATGAAAAATGCTAACACCCAAGAAAAGAAGATAT 
TAAGTGCCTTTTAACAACTCCTAGAGTACAAAATGAGTACATCATAATGCTGGCTCTTCTACTAATGAACCATCGAGTGATATT 
GAATAAATTATTTATCTTCTCAGTTTCCTTATCTGTAAATTACAATATTAGACTAAGTAAGTTTTTCCAACTCTTCACTACCAA 

35 TTACCTTAGGCTTTTATAATGCTCCGCCTACTTCAGTCCCATGTTTCAGAAGCTTTTGTCTATTTTTTAAACTCATTGATTAAA 
TAATGATTAATGCATTCTCCACATTTTAATATTGCAAAGGCCCATTGGAGTTTCTGAAGTGGCTCCACAGAATTGAAATAATTT 
CAAATAACTGTAAAGGAACTGAAAATCTTCACAGAGATGAAGTGGGGTTTCCATTAGGTGCTTTGAAATTTGATAACAAATCAT 
CAACTTCCACTGGTCAATATATAGATTTTGGGTGTCTGAGGCCCCAAGATTAGATGCCACTAATCTCCAAAGATTCCCTCCAAT 
TATGAAATATTTTAATGTCTACTTTTAGAGAGCACTAGCCAGTATATGACCATGTGATTAATTTCTTTTCACACTAGATAAAAT 

40 TACCTGGTTCAAAAGTGGTTTTTGTTTATTAAATTTGGTMTAAATATATATAATACACAGACAGGATAGTTTTTATGCTGAAG 
TTTTTGGCCAGCTTTAGTTTGAGGACTCCTTGATAAGCTTGCTAAACTTTCAGAGTGCCCTGAGACACTTCCAGCCATCCCTCC 
TCCTGCCTTCATTGGGGCAGACTTGCATTGCAGTCTGACAGTAATTTTTTTTCTGATTGAGAATTATGTAAATTCAATACAATG 
TCAGTTTTTAAAAGTCAAAGTTAGATCAAGAGAATATTTCAGAGTTTTGGTTTACACATCAAGAAACAGACACACATACCTAGG 
AMGATTTACACAATAGATAATCATCTTAATGTGAAAGAT^ 

45 AGTCTTCTGTATGGMTTTTGTCACTTAAGATGAGCTGCAAATAAATAATACCTTCAATGGAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAA 

23 



MGNFRGHALPGTFFFIIGLWWCTKSILKYICKKQKRTCYLGSKTLFYRLEILEGITIVGMALTGMAGEQFIPGGPHLMLYDYKQ 
GHMQLLGWHHFTMYFFFGLLGVADILCFTISSLPVSLTKLMLSNALFVEAFIFYNHTHGREMLDIFVHQLLVLWFLTGLVAF 
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LEFLVR1MVLLELLRSSLILLQGSWFFQIGFVLYPPSGGPAWDLMDHENILFLTICFCWH 
RLCSSEVGLLKNAEREQESEEEM 

24 

GAAGCACATCTGGACAGCTGTGCGGCCTCCTTGCGGGCCGACGTCAGCCGAGCACGTCCCCCACGTCCTCTCCTTCTCGCCACT 
5 TATTATTTATTCGTTTTCCCAAAGAAGCGACTAGGGACCCAAGTTTAAAAATTCCTCCCCGCACTCAATGCGAGACGTGGCCAG 
ATCCCATCCAACACACGGTTTAATTTTCATGGGGCTCTGGGATCAAAAGMCAGAAACAGCAACAACAAAAGCCCAGCCGCTGT 
CTGATTTTAAGCTGGCAAAGTGGGAAAAATAAAGTGTTGAGTAAACAGACCAAGTTGGATCATGGGGAATTTCAGAGGTCATGC 
CCTCCCTGGAACCTTCTTTTTTATTATTGGTCTTTGGTGGTGTACAAAGAGTATTCTGAAGTATATCTGCAAAAAGCAAAAGCG 
AACCTGCTATCTTGGTTCCAAAACATTATTCTATCGATTGGAAATTTTGGAGGGAATTACAATAGTTGGCATGGCTTTAACTGG 

10 CATGGCTGGGGAGCAGTTTATTCCTGGAGGGCCCCATCTGATGTTATATGACTATAAACAAGGTCACTGGAATCAACTCCTGGG 
CTGGCATCATTTCACCATGTATTTCTTCTTTGGGCTGTTGGGTGTGGCAGATATCTTATGTTTCACCATCAGTTCACTTCCTGT 
GTCCTTAACCAAGTTAATGTTGTCAAATGCCTTATTTGTGGAGGCCTTTATCTTCTACAACCACACTCATGGCCGGGAAATGCT 
GGACATCTTTGTGCACCAGCTGCTGGTTTTGGTCGTCTTTCTGACAGGCCTCGTTGCCTTCCTAGAGTTCCTTGTTCGGAACAA 
TGTACTTCTGGAGCTATTGCGGTCAAGTCTCATTCTGCTTCAGGGGAGCTGGTTCTTTCAGATTGGATTTGTCCTGTATCCCCC 

15 CAGTGGAGGTCCTGCATGGGATCTGATGGATCATGAAAATATTTTGTTTCTCACCATATGCTTTTGTTGGCATTATGCAGTAAC 
CATTGTCATCGTTGGAATGAATTATGCTTTCATTACCTGGTTGGTTAAATCTAGACTTAAGAGGCTCTGCTCCTCAGAAGTTGG 
ACTTCTGAAAAATGCTGAACGAGAACAAGAATCAGAAGAAGAAATGTGACTTTGATGAGCTTCCAGTTTTTCTAGATAAACCTT 
TTCTTTTTTACATTGTTCTTGGTTTTGTTTCTCGATCTTTTGTTTGGAGAACAGCTGGCTAAGGATGACTCTAAGTGTACTGTT 
TGCATTTCCAATTTGGTTAAAGTATTTGAATTTAAATATTTTCTTTTTAGCTTTGAAAATATTTTGGGTGATACTTTCATTTTG 

20 CACATCATGCACATCATGGTATTCAGGGGCTAGAGTGATTTTTTTCCAGATTATCTAAAGTTGGATGCCCACACTATGAAAGAA 
ATATTTGTTTTATTTGCCTTATAGATATGCTCAAGGTTACTGGGCTTGCTACTATTTGTAACTCCTTGACCATGGAATTATACT 
TGTTTATCTTGTTGCTGCAATGAGAAATAAATGAATGTATGTATTTTGGTGC 

25 

MPSLWDRFSSSSTSSSPSSLPRTPTPDRPPRSAWGSATREEGFDRSTSLESSDCESLDSSNSGFGPEEDTAYLDGVSLPDFELL 
25 SDPEDEHLCANLMQLLQESLAQARLGSRRPARLLMPSQLVSQVGKELLRLAYSEPCGLRGALLDVCVEQGKSCHSVGQLALDPS 
LVPTFQLTLVLRLDSRLWPKIQGLFSSANSPFLPGFSQSLTLSTGFRVIKKKLYSSEQLLIEEC 

26 

GCAGCAGGCCAAGGGGGAGGTGCGAGCGTGGACCTGGGACGGGTCTGGGCGGCTCTCGGTGGTTGGCACGGGTTCGCACACCCA 
TTCAAGCGGCAGGACGCACTTGTCTTAGCAGTTCTCGCTGACCGCGCTAGCTGCGGCTTCTACGCTCCGGCACTCTGAGTTCAT 

30 CAGCAAACGCCCTGGCGTCTGTCCTCACCATGCGTAGCCTTTGGGACCGCTTCTCGTCGTCGTCCACCTCCTCTTCGCCCTCGT 
CCTTGCCCCGAACTCCCACCCCAGATCGGCCGCCGGGCTCAGCCTGGGGGTCGGCGACCCGGGAGGAGGGGTTTGACCGCTCCA 
CGAGCCTGGAGAGCTCGGACTGCGAGTCCCTGGACAGCAGCAACAGTGGCTTCGGGCCGGAGGAAGACACGGCTTACCTGGATG 
GGGTGTCGTTGCCCGACTTCGAGCTGCTCAGTGACCCTGAGGATGAACACTTGTGTGCCAACCTGATGCAGCTGCTGCAGGAGA 
GCCTGGCCCAGGCGCGGCTGGGCTCTCGACGCCCTGCGCGCCTGCTGATGCCTAGCCAGTTGGTAAGCCAGGTGGGCAAAGAAC 

35 TACTGCGCCTGGCCTACAGCGAGCCGTGCGGCCTGCGGGGGGCGCTGCTGGACGTCTGCGTGGAGCAGGGCAAGAGCTGCCACA 
GCGTGGGCCAGCTGGCACTCGACCCCAGCCTGGTGCCCACCTTCCAGCTGACCCTCGTGCTGCGCCTGGACTCACGACTCTGGC 
CCAAGATCCAGGGGCTGTTTAGCTCCGCCAACTCTCCCTTCCTCCCTGGCTTCAGCCAGTGCCTGACGCTGAGCACTGGCTTCC 
GAGTCATCAAGAAGAAGCTGTACAGCTCGGAACAGCTGCTCATTGAGGAGTGTTGAACTTCAACCTGAGGGGGCCGACAGTGCC 
CTCCAAGACAGAGACGACTGAACTTTTGGGGTGGAGACTAGAGGCAGGAGCTGAGGGACTGATTCCAGTGGTTGGAAAACTGAG 

40 GCAGCCACCTAAGGTGGAGGTGGGGGAATAGTGTTTCCCAGGAAGCTCATTGAGTTGTGTGCGGGTGGCTGTGCATTGGGGACA 
CATACCCCTCAGTACTGTAGCATGGAACAAAGGCTTAGGGGCCAACAAGGCTTCCAGCTGGATGTGTGTGTAGCATGTACCTTA 
TTATTTTTGTTACTGACAGTTAACAGTGGTGTGACATCCAGAGAGCAGCTGGGCTGCTCCCGCCCCAGCCTGGCCCAGGGTGAA 
GGAAGAGGCACGTGCTCCTCAGAGCAGCCGGAGGGAGGGGGGAGGTCGGAGGTCGTGGAGGTGGTTTGTGTATCTTACTGGTCT 
GAAGGGACCAAGTGTGTTTGTTGTTTGTTTTGTATCTTGTTTTTCTGATCGGAGCATCACTACTGACCTGTTGTAGGCAGCTAT 

45 CTTACAGACGCATGAATGTAAGAGTAGGAAGGGGTGGGTGTCAGGGATCACTTGGGATCTTTGACACTTGAAAAATTACACCTG 
GCAGCTGCGTTTAAGCCTTCCCCCATCGTGTACTGCAGAGTTGAGCTGGCAGGGGAGGGGCTGAGAGGGTGGGGGCTGGAACCC 
CTCCCCGGGAGGAGTGCCATCTGGGTCTTCCATCTAGAACTGTTTACATGAAGATAAGATACTCACTGTTCATGAATACACTTG 
ATGTTCAAGTATTAAGACCTATGCAATATTTTTTACTTTTCTMTAAACATGTTTGTTAAAACAAAAAAAAAAAAAAAAA 
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GGPGGPECPMRTPQLALLQVFFLVFPDGVRPQPSSSPSGAVPTSLELQRGTDGGTLQSPSEATATRPAVPGLPTWPTLVTPSA 
PGNRTVDLFPVLPICVCDLTPGACDINCCCDRDCYLLHPRTVFSFCLPGSVRSSSWVCVDNSVIFRSMSPFPSRVFMDSNGIRQ 
FCVHV1MSNLNYFQKLQKVNATNFQALAAEFGGESFTSTFQTQSPPSFYRAGDPILTYFPKWSVISLLRQPAGVGAGGLCAESN 
PAGFLESKSTTCTRFFKNLASSCTLDSALNAASYYNFTVLKVPRSMTDPQMEFQVPVILTSQANAPLLAGNTCQNVVSQVTYE 
5 IETNGTFGIQKVSVSLGQTNLTVEPGASLQQHFILRFRAFQQSTAASLTSPRSGNPGYIVGKPLLALTDDISYSMTLLQSQGNG 
SCSVKRHEVQFGVNAISGCKLRLKKADCSHLQQEIYQTLHGRPRPEWAIFGNADPAQKGGWTRILNRHCSISAINCTSCCLIP 
VSLEIQVLWAWGLLSNPQAQVSGVRFLYQCQSIQDSQQVTEVSLTTLWFVDITQKPQPPRGQPKMDWKWPFDFFPFKVAFSR 
GVFSQKCSVSPILILCLLLLGVLNLETM 

28 

10 GGGGGGACCCGGAGGCCCTGAATGCCCCATGCGCACCCCACAGCTCGCGCTCCTGCAAGTGTTCTTTCTGGTGTTCCCCGATGG 
CGTCCGGCCTCAGCCCTCTTCCTCCCCATCAGGGGCAGTGCCCACGTCTTTGGAGCTGCAGCGAGGGACGGATGGCGGAACCCT 
CCAGTCCCCTTCAGAGGCGACTGCAACTCGCCCGGCCGTGCCTGGACTCCCTACAGTGGTCCCTACTCTCGTGACTCCCTCGGC 
CCCTGGGAATAGGACTGTGGACCTCTTCCCAGTCTTACCGATCTGTGTCTGTGACTTGACTCCTGGAGCCTGCGATATAAATTG 
CTGCTGCGACAGGGACTGCTATCTTCTCCATCCGAGGACAGTTTTCTCCTTCTGCCTTCCAGGCAGCGTAAGGTCTTCAAGCTG 

15 GGTTTGTGTAGACAACTCTGTTATCTTCAGGAGTAATTCCCCGTTTCCTTCAAGAGTTTTCATGGATTCTAATGGAATCAGGCA 
GTTTTGTGTCCATGTGAACAACTCAAACTTAAACTATTTCCAGAAGCTTCAAAAGGTCAATGCAACCAACTTCCAGGCCCTGGC 
TGCAGAGTTTGGAGGCGAATCATTCACTTCAACATTCCAAACTCAATCACCACCATCTTTTTACAGGGCTGGGGACCCCATTCT 
TACTTACTTCCCCAAGTGGTCTGTAATAAGCTTGCTGAGACAACCTGCAGGAGTTGGAGCTGGGGGACTCTGTGCTGAAAGCAA 
TCCTGCAGGTTTCCTAGAGAGTAAAAGTACAACTTGCACTCGTTTTTTCAAGAACCTGGCTAGTAGCTGTACCTTGGATTCAGC 

20 CCTCAATGCTGCCTCTTACTATAACTTCACAGTCTTAAAGGTTCCAAGAAGCATGACTGATCCACAGAATATGGAGTTCCAGGT 
TCCTGTAATACTTACCTCACAGGCTAATGCTCCTCTGTTGGCTGGAAACACTTGTCAGAATGTAGTTTCTCAGGTCACCTATGA 
GATAGAGACCAATGGGACTTTTGGAATCCAGAAAGTTTCTGTCAGTTTGGGACAAACCAACCTGACTGTTGAGCCAGGCGCTTC 
CTTACAGCAACACTTCATCCTTCGCTTCAGGGCTTTTCAACAGAGCACAGCTGCTTCTCTCACCAGTCCTAGAAGTGGGAATCC 
TGGCTATATAGTTGGGAAGCCACTCTTGGCTCTGACTGATGATATAAGTTACTCAATGACCCTCTTACAGAGCCAGGGTAATGG 

25 AAGTTGCTCTGTTAAAAGACATGAAGTGCAGTTTGGAGTGAATGCAATATCTGGATGCAAGCTCAGGTTGAAGAAGGCAGACTG 
CAGCCACTTGCAGCAGGAGATTTATCAGACTCTTCATGGAAGGCCCAGACCAGAGTATGTTGCCATCTTTGGTAATGCTGACCC 
AGCCCAGAAAGGAGGGTGGACCAGGATCCTCAACAGGCACTGCAGCATTTCAGCTATAAACTGTACTTCCTGCTGTCTCATACC 
AGTTTCCCTGGAGATCCAGGTATTGTGGGCATATGTAGGTCTCCTGTCCAACCCGCAAGCTCAAGTATCAGGAGTTCGATTCCT 
ATACCAGTGCCAGTCTATACAGGATTCTCAGCAAGTTACAGAAGTATCTTTGACAACTCTTGTGAACTTTGTGGACATTACCCA 

30 GAAGCCACAGCCTCCAAGGGGCCAACCCAAAATGGACTGGAAATGGCCATTCGACTTCTTTCCCTTCAAAGTGGCATTCAGCAG 
AGGAGTATTCTCTCAAAAATGCTCAGTCTCTCCCATCCTTATCCTGTGCCTCTTACTACTTGGAGTTCTCAACCTAGAGACTAT 
GTGAAGAAAAGAAAATAATCAGATTTCAGTTTTCCCTATGAGAAACTCTGAGGCAGCCACTTATCTTGGCTAAATAGAACCTCA 
CCTGCTCATGACCAGAGAGCATTTAGGATAATAGAGGACCTAACTGAAGGAATCCTTGTATATGAAAGGAGTTATTTTAGAAAA 
GCAATAAAAATATTTTATTCATCATAGCTCTCTGCTTTGGGCTCTGCAGGCCACCAGATACACATGAGGCCCCTACTTCTCAAG 

35 CTGGGAAGGCCAAGAGCCTTCCTTCAGCCTTTCTGGTTATGTTACACCTAGCTGAATGTTTACAAGGTCTGGATCCATCAGCCC 
TCAGGCACAGTTGGGCCAAGCAGAAAGAGAGAAACTCTTCTGCTGTCACCTTGAATGAACTCAGGAATAGCTTCCCTCTGGACT 
GTAGAGGAGCTAACTGTTTGGAACAGAAAACTGCTGGCTGTTGATTTTGTCTGGTTCCTTTGCCAACATCTGGGCACACCCTTT 
GCCCAGACACGAGTGGGGAAAGCAGTTCTTTCTCCTCAGTTTCCAAAGTAAATGGGGAATCCCAGCTTTCTTTTCTACTAGCAA 
ATGACCCTACCATTTATTTCTGCCTTTTTCTTCCGTTCATTGTGAGGAAAAATAAAACTGGTTGAGAGCTTTGTTGTACCAAAA 

40 AAAAAAAAAAAAAAA 

29 

RSHFICDLDFFVLHYIFIDFVINMVLG^ 

SRNAGSNTAYTQNYTEEVEYFNHKDILIGHERDDDNSEEGFHTIHSGRHEYAFSFELPQTPLATSFEGRHGSVRYWVKAELHRP 
WLLPVKLKKEFTVFEHIDINTPSLLSPQAG 
45 QTQAFYAKGKMKEVKQLVMLRGESLSSGKTET^ 
LHPFGSRTSSVSSQCSMNMMLSLSLPER 
DPNPDQSADDRPSCPSR 
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GACCACTGAGACGAGCGGGAGCGCGGAGCAGCAGCCTCTGCTGCCCTGACTTTTTAAGAAATCTCAATGAACTATTTGTAGAGA 
ATCACTGATCCGGCCTGCAAGCATTTTGCACGGCAAAAATATCGATCAGTGTTAAGTGAAGATCACATTTTATATGCGATCTTG 
ACTTTTTTGTCTTACATTATATTTTTATAGATTTTGTTATAAACATGGTGCTGGGAAAGGTGAAGAGTTTGACAATAAGCTTTG 
ACTGTCTTAATGACAGCAATGTCCCTGTGTATTCTAGTGGGGATACCGTCTCAGGAAGGGTAAATTTAGAAGTTACTGGGGAAA 
5 TCAGAGTAAAATCTCTTAAAATTCATGCAAGAGGACATGCGAAAGTACGCTGGACTGAATCTAGAAACGCCGGCTCCAATACTG 
CCTATACACAGAATTACACTGAAGAAGTAGAGTATTTCAACCATAAAGACATCTTAATTGGGCACGAAAGAGATGATGATAATT 
CCGAAGAAGGCTTCCACACTATTCATTCAGGAAGGCATGAATATGCATTCAGCTTCGAGCTTCCACAGACACCACTCGCTACCT 
CATTCGAAGGCCGACATGGCAGTGTGCGCTATTGGGTGAAAGCCGAATTGCACAGGCCTTGGCTACTACCAGTAAAATTAAAGA 
AGGMTTTACAGTCTTTGAGCATATAGATATCAACACTCCTTCATTACTGTCACCCCAAGCAGGCACAAAAGAAAAGACACTCT 

10 GTTGCTGGTTCTGTACCTCAGGCCCAATATCCTTAAGTGCCAAAATTGAAAGGAAGGGCTATACCCCAGGTGAATCAATTCAGA 
TATTTGCTGAGATTGAGAACTGCTCTTCCCGAATGGTGGTGCCAAAGGCAGCCATTTACCAAACACAGGCCTTCTATGCCAAAG 
GGAAAATGAAGGAAGTAAAACAGCTTGTGGCTAACTTGCGTGGGGAATCCTTATCATCTGGAAAGACAGAGACGTGGAATGGCA 
AGTTGCTGAAAATTCCACCAGTTTCTCCCTCTATCCTCGACTGTAGTATAATCCGCGTGGAATATTCACTAATGGTATATGTGG 
ATATTCCTGGAGCTATGGATTTATTTCTTAATTTGCCACTTGTCATCGGTACCATTCCTCTACATCCATTTGGTAGCAGAACCT 

15 CAAGTGTAAGCAGTCAGTGTAGCATGAATATGAACTGGCTCAGTTTATCACTTCCTGAAAGACCTGAAGCACCACCCAGCTATG 
CAGAAGTGGTAACAGAGGAACAAAGGCGGAACAATCTTGCACCAGTGAGTGCTTGTGATGACTTTGAGAGAGCCCTTCAAGGAC 
CACTGTTTGCATATATCCAGGAGTTTCGATTCTTGCCTCCACCTCTTTATTCAGAGATTGATCCAAATCCTGATCAGTCAGCAG 
ATGATAGACCATCCTGCCCCTCTCGTTGAAGGAACACTTGGTTGAATCAAGTTGATGTGGGTTCCGAACTGTATCTCTTCCGGC 
TGAGGACAGAGAAGTATCTTGGAGACACGTTTCAGAGGAAGTGGAATTACTTTTGCCCAGAAAAATGGCGAATACATGAAACAA 

20 CCAGTGATCATGCTTTAGAAGCCTACAGCAACATTCTGAGACTGCTCCAACATGCTTGAAGATCTAAGCTTTTCTCTTTTAAAA 
CTGGCACATACTCAGAGCAGTCTTCTTAGCCTATGGTCGTACGTGTCAAGACATGACGTTGTAAAGAGGGATGATTTCCTTCTT 
TTGATTTGAAAATTTGCACATGCTCAATGCTTACATTGTGCGGTTCGACGTCACTACAGCTTCTTTTTTTTTTTTTTTTTTTTT 
CTATTTTTGCCAGACTCTTGATACTCTTAAAACTTGTTTGTGGTCAGCACAACAAGGAACAAAACAAAGCTTTGAAAAAACTTT 
AACATGAAAAAACGCACTGACATTTTTTTTTATTTAATATAGCCTGGACTTTACCTGCGTATGCACATGCTCAGAATTGTCTAC 

25 TAGGCTGACTATGTATCACCTCTTCAGCTTGGATCCAATTGTGGATTTATTTACAAACATCAAATGCCTTCAAGCCAATCCTTT 
TTGCTGTATGTTTTGCAGCCTACTGTAGTAGATACGCAACAGATAATGTGGGAAAAAAAGAGATAAGAGGAGGAAGCTAATAAG 
AGACTGTCAAGATTGTATACCTTCTTGGTTTCTTTTAAGAATTTGTTGCCTTTCTACTATTACAGCAAAGCAGCATTTTGTTAC 
TGACTGCCTAAAATCACTTAATCTCAGGTGAACGCATCACTTGCCAAACTGTTGGAATGCTATTTGTGTTTTGTTGCACTGTTT 
TTTTCGTTTGTTTGTTTGTTTATTTGGTTGGCTTTTTGGAGAGGGAAATTTGGAAACGGGACATACACAAAAGTTACACACCCA 

30 CATTCCCTTTTTATCATGACATACAAGAAGAAACTAGCAGAGCTAAGAATGGAGTGAAGAAAGGCAGTATGGCAGGCACCAGCA 
AAGAGTTGAGGGCTGTTGCTCTTAAAAATTATTTTTTTTATTATTATTTTGAAAGTATGGAAGTTTTCCATTCACTGGGGAAAG 
GAGGGAAAAGTGCATTTATTTTTATACAGAGTTACTTAATTACCTCCAAAACACATATGTTGGAAATCGCTTTTGCTGGTGCAA 
AGTATATTAATGAGCAGGAATACATACATTGAGGTTATGAATAGAGAGCTCAATTTGTACCTTTGCTGTCTTGCTCAAGCTTGG 
TATGGCATGAAMCTCGACTTTATTCCAAAAGTAACTTCAAAATTTAAAATACTAGAACGTTTGCTGCGATAAATCTTTTGGAT 

35 TTTTGTGTTTTTCTAATGAGAATACTGTTTTTCATTACCTAAAGAACAATTTGCTAAACATGAGAAATCACTCACTTTGATTAT 
GTATAGATTACATAGGAAGAACAATCACATCAGTAAGTTATAGTTTATATTAAAGGTAATTTTCTGTTGGCTCATAACAAATAT 
ACCAGCATTCATGATAGCATTTCAGCATTTTCCAAGGTACCAAGTGTACTTATTTTGTTGTTGTTGTTGTTGTTGTATTTTAGA 
AGGAATTCAGCTCTGATGTTTTTAAAGAAAACCAGCATCTCTGATGTTGCAACATACGTGTAAAATGGGTGTTACATCTATCCT 
GCCATTTAACCCCACAGTTAATAAAGTGGCTGAAAATAATAGTAGCTCTGGCTTGGTGCTTGACCTGGTTAAATACTGTCTTAA 

40 AGCTCATACAAAACAAATAGGCTTTTCCATAAGTGGCCTTTAAGAAAACATGGAAGAGAATTCATGTTTGACAAATGCTGACAG 
GGTGAAGAAAGCCCAGTGTAAAAATGAATCGCGTTTTAAGTGATTCGGTTAAAGAGTTTGGGCTCCCGTAGCAAACTAATACTA 
GATMTAAGGAAATGGGGGTGAMTATTTTTTTATTGTTGAATCATTTTGTGAATGTCCCCCTCAAAAAAAGCTMTGGAATAT 
TTGGCATAAAGGGCATTTGGTGGTTTTATTTTTGTTTGAGGGGGATTGTCAGAAAATCCCTTTTCTCTCTTACGTCTAACTGAC 
TAGGGAACAATTGTTGATATGCATAGCATTGGAATACTTGTCATTATATACTCTTACAAATAACACATGAAGCAAGAATGACCA 

45 ATATTCTGATAATTGGCACTGGATCACAAAATGTGATAAAACTTTAAATGTATAAAACTTTATCAAATAAAGTTTTATTTTCCC 
CTTTAAAATGTATTTCTTTAGAGGCATTACTTTTTTAAAAATATTGGTCAATTCCTGACATAAGATGTGAGGTTCACAGTTGTA 
TTCCAGTATTCMGATAGATTCCTGATTTTTCAATTAGGAAAAGTAAAATCCAAAATGTTAGCAAAACAAAGTGCAATATTAAA 
TGTTTGCTTTATAGATTATATTCTATGGCTGTTTGTAATTTCTCTTTTTTTCCTTTTTTATTTGGTGCTGAATATGTCCTTGTA 
GGCTCTGTTTTMGAAAACAATATGTGGGAAATGATTTAATTTTTCCTATTGCTCTTCCTTGTGGAAAATAAAGTGTTTTGTTT 

50 TTTTCTGTTTTGTAT 
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31 

MLGKGGKRKFDEHEDGLEGKIVSPCDGPS 

MFTPSSQPTTEPSDSYREAPPAFSHLASPSSHPCDLGSTTPLEACLTPASLLEDDDDTFCTSQAMQPTAPTKLSPPALLPEKDS 
FSSALDEIEELCPTSTSTEAATAATDSVKGTSSEAGTQKLDGPQESRADDSKLMDSLPGNFEITTSTGFLTDLTLDDILFADID 
5 TSMYDFDPCTSSSGTASKMAPVSADDLLKTLAPYSSQPVTPSQPFKMDLTELDHIMEVLVGS 

32 

CTCCTGCACGGCGAGTGCTGGAGCACGAGCTACCGCTCGCTCGGTCAGGGCGCCCCCTCCGCCCGCCTCCTGCTTCCTCCTCCG 
CTGCCTGCCGCCGCCGCCTCCACCATTGTATAATGCTCGGGGCGCGCAGGCAGAGAACGGCGGAGTCTTAGCTTCAGCCTCGCC 
TGCTGCCCGCTCCCCGGCGCCACCCTCGGGCCCCTGGAGCGGGGCACTCCGCATGGAGCGGGAGTAGCTGAGGAGTGGGCGGAA 

10 ACCCCTCCTGATGCGTTAGTTCCCAGGTGGAGCTGCATGTGATATATGTTGGGTAAAGGAGGAAAACGGAAGTTTGATGAGCAT 
GAAGATGGGCTGGAAGGCAAAATCGTGTCTCCCTGTGACGGTCCATCCAAGGTGTCTTACACCTTAGAGCGCCAGACTATCTTC 
AACATTTCCCTTATGAAACTCTATAACCACAGGCCCCTGACAGAGCCCAGCTTGCAAAAGACCGTTTTAATTAACAACATGTTG 
AGGCGGATCCAGGAGGAACTCAAACAGGAAGGCAGCCTGAGGCCCATGTTCACCCCCTCCTCCCAGCCCACCACCGAGCCCAGC 
GACAGCTACCGAGAGGCCCCGCCGGCCTTCAGCCACCTGGCGTCCCCGTCCTCCCACCCCTGCGACCTCGGAAGCACTACGCCC 

15 CTGGAGGCCTGCCTCACCCCGGCCTCACTGCTCGAGGACGACGATGACACGTTTTGCACCTCCCAGGCCATGCAGCCCACGGCT 
. CCCACCAAACTGTCACCTCCAGCCCTCTTGCCAGAAAAGGACAGTTTCTCCTCTGCCTTGGACGAGATCGAGGAGCTCTGTCCC 
ACATCTACCTCCACAGAGGCGGCCACGGCTGCGACTGACAGTGTGAAAGGGACCTCCAGCGAGGCTGGCACCCAGAAACTCGAC 
GGTCCTCAAGAGAGCCGCGCAGATGACTCAAAACTGATGGACTCTCTGCCTGGGAATTTTGAAATAACGACGTCCACGGGTTTC 
CTGACAGACTTGACCCTGGATGACATCCTGTTTGCTGACATTGATACGTCCATGTATGATTTTGACCCCTGCACTTCCTCATCA 

20 GGGACAGCCTCAAAAATGGCCCCTGTGTCTGCCGACGACCTCCTCAAAACTCTGGCTCCTTACAGCAGTCAGCCTGTCACCCCA 
AGTCAGCCTTTCAAAATGGACCTCACAGAGCTGGACCACATCATGGAGGTGCTTGTTGGGTCCTAAGACCCAGGGACCCAGCGA 
CTATGCCCACCCAGACCCCAGAGCGTTCCCATAACCCTGACAGTTCTCCACACTGTGCATGCACCCTTGCTTGCCTTTTTCAGA 
GAAAAAGAAAATTTTACAACAGGATCACACTAGTTTTTGCTTTGAGCAGAGTTGGAGTGCCTTCATCCAAGTATGACCACTTTT 
AATACACTTTTTTGAGTGGTTCCTCAGAGACCTACTACCCTGGTATAGGAAAGAATCCATTTGAAGACAATGTTGCAATGTTGA 

25 ATGACAAAAATAAACAGTTCAAGTGAAGCACAAGGATTAAGTTGGAAAAGCTGTAAATTGCATGTGCATATTTGTCTATTTTTT 
CTATAAGTTTTATTGCAAGAGGTAAAGAAGAAAACTATATATATATATCTTATTTAGATAATCTCAGTACCTTTTCTGGCATTT 
TTGCCCTGTATAGGTTGACTTGGCAATTCGGCCTTTTTAGAGGCATTAACTACTCCTCGTAAGTGTTGCATTTACATGGCTGTT 
TAGAAAACTGCTGCCCAAATTTATTTTATATTTTTGTACAGATTCTGCAGTTTATGATATTGTTTTTCTAAAAACAAATGCTGT 
TTATACATATGAGATAGCTATTTTGATAGGATTTGCTCACATAGTTCCTGCAAACTTCAGATGTACAAGTTGCACTTGTACTTT 

30 TATAGAGTTGTAATGTTTTATATGTGTATGGTGCAAGAGAAAATTGGATCAAATCAATCTGCAGTTGATGTCCCCAAATGCAAA 
CACAGGCACACACATGCACACACCCATAAACACACACACAGTGCTTTAAGAAAGGGCCAGGTGATATCACACCCAAATTTCACA 
AGCACTGACCCCCTGGCACCAACACCCGCCAGTACTGTGACTTCCAAAGCCAGAGCCACATGTGCTCATCAAACTTGCATTAAG 
CAGTTGGCGGGAGATGGCTGTGGAGCTGGGGGTTTAAGTGATGGTTCTCTTTTGCTCCCTCTTTTGAGGGTAAAGCTACTGTCT 
TTCTTAAGAGTGTATTTATGCCAAGTTTGCGCTTTTAATTGTTTTTATTTTGTTTTTTAATGAAAACCCAGATCTTTCCTTTTT 

35 GGCATAATTTTTATGATGACCTGAAATTTTACATCCGAACAAAATTTTACATCCGAAAAGCAACCAACTTCTTCATGGAACTCA 
GCCCTGTTGCAATGCTTAGGGCCCTTAAAGAAGAAAATCTCCCCAGAAGGCATCCATCATGTTGCTTAATTGTCTTCTGCAGCT 
TCCTTTCCCTAGAGCTTTCCCTGTGTTGCTAAGAGCTGAAAATGGCATCTTCGTGATCACCACAGTGAGCTTGGCTCGCCTCGG 
CCGGCCCGGGATGCACTCTTACAACATGTGTGACTCTTGAACCTGGAGTTCATCACATTACGTCACAGCTTCCCATCTGGTTGC 
TTTCCTGAGTCAGCTACTTCACACTTGTCAAGGCTGTTTTACCCCAAAACTCAGACAGGACTTTGTATGCATGTTTTCCCTCCT 

40 CCCCCCAATTCCCCCCCCATCACCTTATCTCCCAGGACACACTTGAGAAGTAGCTTTTTATTCCTAGTGGTGTACATTTAATTT 
TAAAAAGGTTGCAATGTATCATGCTTGTTGCCGAAACTGTTTATGGCCTTCTTGTTTCAGTTTTTTCTTTTCTTCCAATGGTAC 
TTTAGCTGTTGAGTGCAGGTTACAACCTATATTGTTATGCAGATGGCTTCTTTAGGAATAACTTTTATATTTATTTAAAAATTT 
TTAAATTATGGGATGTTTTGTTGTTGTTGTTGTCTTTGTTGTTGGTCATTTGTCAATATTCAGTCACCAATTCTGCTCACTTCT 
TGCCATGGATAAAATTGGGTCTTTCTGGCTAATTAAAAM^ 

45 TATTTTTGTAATGCACATGGCAAAGCAAAGACGTTTGTGATGAAGGAACTGCTCATCTAAGCAAAAGATTTGAGTATGATATGA 
TAAAGGCTTTCTACATTCTAATTTACTTTTTCCCCCCACTTGAATGTGTTTTAAAGGCTAATTATCAGCTCAGTAGAGCAGTGA 
GAAACTGATCAAATTGCACTTGTTCTCCTACAAGCAACCTCCACGCAGACACCTCGTACTGCTACAGGTGTGTCATTTCCTTTA 
ATAGGACCAGGGACCATGTAACTGAGGTGAGGGTTGTAGTAGATGCTTCCAGTGTCAGTATGCCTGTTAATTTTAAGAGCTTCC 
CTTTCTTGCAGAGAACAAGTCTGCCCAGATTCCATGCTTTCTATAACTGGAGGACCTGGCAAACCTGCCGCATGCTGCACACAT 

50 CTACCTACGTACACATATACAATAGTATTGATGATTCTGAACAATAACAGGGTAAAACAGTTGGTTTGCCATTGTTAAAAACTG 
ATTTACAGTAACTTACAACAACTGTACTTTTGTTGGATTAGCAAATCATGTGTTTAAACAAATCCCATATGTTGGGCAACAGTT 
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CAAATAAGCACGGAGAAGTGTTGCCCAAACTTGGTTCTCTGACTCTTATGTATTTGTAAGGCTGGGCTTCAAAATCAAAACAAA 
AACCCCAAAAACAGCAGGCAAATGCTTTTTAACTCTGACACCGTTGCCATAAATCCCTGATACTCAAAGTCTAACAAGAAAGAC 
ATGGAAAATTAGCAGCCCATTTTCAGAAAGATCAAAATGATCTAGGGTTCTAATTGCTTTTGCATCCTATTCTTACAAAGTGAT 
GTCCCAACAGGGAACAGTAGGAGCTGGAGTGGGATCTCCAAGTCCCAGTTTGAGTGTGGGATGTGCTTCCAGCAGTGCCTTCCC 
5 TTTATGAAAGACATCACATGGCATCCAGGGCCAGGCAGGCAGCTTGAGGTGCCTTTACGAGAAAACCGAGCTGGGGCTGGGAGA 
GGACAGTTATTGACACTGATGTGCAATGAAGTGACAAGATGAGAGCAGAATCGTAAGAGCTTTGAATTTGAAGTGAGTTTTTTT 
CCCCCCATAAGTTATTTATTCCTTTTTTCTGTGTAAATATATTTATTTTACTGTGGAGCGCTAACATCTGGATCGTAACATGTG 
CAGAATGTATGGTAGGAATGTATTCTCTTGTAGGAATGTAAATCTGTATTAAAAGGGGGTCCAAGCCAGGCCCCCAGGTCTTCT 
CATTGTATGCACAGTCCGCATTCATTTTTACTCTTCTCTAATATGGGTCTATTTGAAATATGCAAAAGGTATGAGGAATGTTTT 
10 AATACCTCCAAATTTTTAAGAAAAGCATCAAAGGGTTGATATTTTTTAAAGTTTTTTTAGTAGCACTTTCTCTGGATGACAGAA 
GGGGCAACCACATGGGCACCCTTGTTCATACCAAAGGGTGAGCAGTGGCCAGAGCCTCCTCTGCACCTCTCGAGTGTCTTTACC 
AATTGAGCTTTTTATCGCCATAGCCCCTTGGAGTGCCCCAGCTGCCCTGAGGTCAATCAAGGAAAATTTCTTAATGAAATAAGC 
TCCAAAGAGCCAAAGTATCAACTTACAGATCGTTTTTAAAGCTTAAATTTATGAACCACCTTTGTGGTAAACAATGAATTATGA 
ATACCGCAGGGCAGCCTTCTTAAATGACAAATGTAAAAAAAAAAAAAAAAGACTCTACTTCGTGCAGCAATTGCTACTCTATAC 
. 1 5 GAATTGTCTTAATTTGAAAACCTTGCTGTTACAAATTGGACCTTTATACATTTTCTGAAMCAATGAAAAGAGTATATTTAACC 
TTTTCTGGCTGTAAATGGTTACCTTCCTGTAACTGCCCCGCACCTGGAGGCATGGAGTTGTGTGCATCCTGCTTATGTACAATT 
GTTTTCAGTGTTTCTAAGAATGAGTCTGAATGGTTCTTGAAAATTAGCCAGGATCAAATGCTATTGCAGACAAAGCCAATAAAA 
AGTTGGACTTCTTTTGGGGATAACAAGTTTTGGAAGAGAAATGCAGGCCATATGTGCGCATGACCGAGATTTTGAAAAAAGATG 
TACATAGTGACATGTTTGGTGCATGGTTTTTGAGGAGGGCTTTTGTCAAAAAGGAGGTATAACCTTTCCCCCACAGACCTGAGA 
20 GCTGTGCCTTTTCTATGCAATATTACAGACGTTACATCGGAACCCAGATGGCTGTATTCACATGTAGGTTTGGGCTGTAATCTA 
AACAATTGGACAGATTAAATGTACATGGAAATGAGCAGTCTTACOT 

33 

MTSRDNYKAGSREAAAAAAAAVAAAAAAAAAAEPYPVSGAKRKYQEDSDPERSDYEEQQLQKEEEARKVKSGIRQMRLFSQDEC 
AKIEARIDEWSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYGAQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPE 
25 GFWSAVINDYQPGGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAADEITHCI 
RPQDIKERRAVIILRKTRLDAPRLETKSLSSSVLPPSYASDRLSGlSINRDPALKPKRSHRKADPDAAHRPRIIiEMDKEENRRSVL 
LPTHRRRGSFSSENYWRKSYESSEDCSEAAGSPARKVKMRRH 

34 

AAGTGTTACTTCTGCTCTAAAAGCTGCGGAATTCCTCGAGCACTGGCGGCCGCCAGCGGCTACACGGACCTGCGTGAGAAGCTC- 

30 AAGTCCATGACGTCCCGGGACAACTATAAGGCGGGCAGCCGGGAGGCCGCCGCCGCTGCCGCAGCCGCCGTAGCCGCCGCAGCC 
GCAGCCGCCGCTGCCGCCGAACCTTACCCTGTGTCCGGGGCCAAGCGCAAGTATCAGGAGGACTCGGACCCCGAGCGCAGCGAC 
TATGAGGAGCAGCAGCTGCAGAAGGAGGAGGAGGCGCGCAAGGTGAAGAGCGGCATCCGCCAGATGCGCCTCTTCAGCCAGGAC 
GAGTGCGCCAAGATCGAGGCCCGCATTGACGAGGTGGTGTCCCGCGCTGAGAAGGGCCTGTACAACGAGCACACGGTGGACCGG 
GCCCCACTGCGCAACAAGTACTTCTTCGGCGAAGGCTACACTTACGGCGCCCAGCTGCAGAAGCGCGGGCCCGGCCAGGAGCGC 

35 CTCTACCCGCCGGGCGACGTGGACGAGATCCCCGAGTGGGTGCACCAGCTGGTGATCCAAAAGCTGGTGGAGCACCGCGTCATC 
CCCGAGGGCTTCGTCAACAGCGCCGTCATCAACGACTACCAGCCCGGCGGCTGCATCGTGTCTCACGTGGACCCCATCCACATC 
TTCGAGCGCCCCATCGTGTCCGTGTCCTTCTTTAGCGACTCTGCGCTGTGCTTCGGCTGCAAGTTCCAGTTCAAGCCTATTCGG 
GTGTCGGAACCAGTGCTTTCCCTGCCGGTGCGCAGGGGAAGCGTGACTGTGCTCAGTGGATATGCTGCTGATGAAATCACTCAC 
TGCATACGGCCTCAGGACATCAAGGAGCGCCGAGCAGTCATCATCCTCAGGAAGACAAGATTAGATGCACCCCGGTTGGAAACA 

40 AAGTCCCTGAGCAGCTCCGTGTTACCACCCAGCTATGCTTCAGATCGCCTGTCAGGAAACAACAGGGACCCTGCTCTGAAACCC 
AAGCGGTCCCACCGCAAAGCAGACCCTGATGCTGCCCACAGGCCACGGATCCTGGAGATGGAGAAGGMGAGAACCGGCGCTCG 
GTGCTGCTGCCCACACACCGGCGGAGGGGTAGCTTCAGCTCTGAGAACTACTGGCGCAAGTCATACGAGTCCTCAGAGGACTGC 
TCTGAGGCAGCAGGCAGCCCTGCCCGAAAGGTGAAGATGCGGCGGCACTGAGTCTACCCGCCGCCCTCCTGGGAACTCTGGCTC 
ATCCTTACGTAGTTGCCCCTCCTTTTGTTTTGAGGGTTTTGTTTTTGTTCATTGGGGGGTTTTTGTTTTTTGTTTTTTGTTTTT 

45 TTTGATTCTATATATTTTTCCTTGGTTTTGTTGCCTGTTAGGGCTGAAGAATAGAATTGGCCAGGACCTAGGTTCTCATATTCT 
TGGTATTCCTCCTGGATGGAAAGGCTGTTGGCATCAATAGGGGACAGAGGCTGATGCTGGAGTGGCCAGTAGAGGTGGTGGAGC 
AGAGCAGCCATCTTTTAAGTGGGGCTGTATCAGGCTGGGTTTATTTAAAAGCAACAAAATGTTTTGGTTAAGAAAATTATTTTG 
CTTTCAGTGTAAATCTTCGCAGTGTTCTAAACAAAGTTCAGTCTTCTGCTCGCCCCTTTCCCTCACTGATGTCTGCACTTGGTT 
GAGGTCTCCTGGAGCCTCACAGGCTCTGCTGTTCTCCACTTCTCACCTGCCATCCACGCCCTGCAAGCTCATGCAAACACCCTT 

50 TCTTCCTCCTGCGGCAGAGTTGTTCAGGTTGCCTGGGCAGGGGCTTAAACAGTGCCAGCCCCTGCCATCCCAAAGCTATTGTTA 
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AGCCCCCCAGGCGTCCTCCACCCACGCCCACTAGCCTGCCATGTCCACAGTTCCTTGGGCTGCTGAGGGGCTAGTGCAGTGGTC 
CTGACCTCTCTTATCAAGAGCACACTTCTTTGCTGGTTGCTCCTTTTGAGCATATGCGTGTGATTATTTGGAACAGTTAGACTT 
GCCACGTTGGGTCAGTTTTAGAAATTGTTTCTAGCTAGAGGGACTGGTGTCCTTCCAAGTCTAGCATTTGGGGTATGGAAAATT 
GTTGTGGTGTGTGGTAGGGTTTTTGTTTTCTTTTTTGAGTTTTTTTTCCCCCTTTAGTCTCCTGGCTTTTTCCTTTCCCTTCCC 
5 TTCTCCACTGGCCAGCTTGGGCCTCATCCTCATGTCATCCTTCTAGGAAGGCGCCTGCCCCATCTTGTCTGCCGGCAGCATGCA 
TCCAAGGCCAGAGCTCAGGCCTGCAGACTGGGCTGGTGCCTCCTCCGCTTCAGGGTATGGGAGTTGGTGAAGGGGCTTTCAAAA 
AAAAAAAAAAAAAAAAAAA 

35 

MMTWLELDPIFLKALGFLHSKSKDSAEK^ 
10 KVLTTEKVKKEAEKRPADKMKSDITEGVDIPKKPRLEKPETQSSPITVQSSKDLPMADLSSFEETSADDFAMEMGLACWCRQM 
MVASGNQLVECQECHNLYHRDCHKPQVTDKEANDPRLVWYCARCTRQMKRMAQKTQKPPQKP 

LKQETTFLAFKRTEVKTSTVISGNSSSASVSSSVTSGLTGWAAFAAKTSSAGPSTAKLSSTTQ]MTGKPATSSANQKPVGLTGL 

ATSSKGGIGSKIGSNNSTTPTVPLKPPPPL^ 

TSESSSSPSASLKGPTSQDHSSML 

15 36 

GTGAGTTGACCAGAGAAGGATGAGAGAATTGCCAAGTTGTACTGAACGCCAAGCTGAGAATGGTGATTCTGAAGATAGAATGCG 
TTTGCAATGGCTGCTACTGTGAACTTGG^^ 

TCTGCTGAAAAGCTAAAAGCACTGCTTGATGAATCTTTGGCTCGGGGCATTGATTCCAGTTACCGTCCATCTCAAAAGGATGTG 
GAGCCACCCAAAATTTCAAGCACAAAAAACATTTCCATTAAGCAAGAGCCCAAAATATCATCCAGTCTTCCTTCTGGTAATAAT 

20 AATGGCAAGGTCCTCACAACTGAAAAGGTAAAGAAGGAAGCTGAAAAGAGACCTGCTGATAAAATGAAATCAGACATCACTGAA 
GGAGTTGATATTCCAAAGAAACCTAGATTGGAGAAACCAGAAACACAGTCATCTCCCATTACTGTCCAAAGTAGCAAGGATTTA 
CCTATGGCTGACCTTTCCAGTTTTGAGGAGACCAGTGCTGATGATTTTGCCATGGAGATGGGATTGGCCTGCGTTGTTTGTAGG 
CAAATGATGGTGGCATCTGGCAATCAATTAGTAGAATGTCAGGAGTGCCATAATCTCTACCACCGAGATTGTCATAAACCCCAG 
GTGACAGACAAGGAAGCGAATGACCCTCGCCTGGTGTGGTATTGTGCCCGATGTACCAGACAAATGAAAAGAATGGCTCAAAAA 

25 ACTCAGAAACCACCGCAGAAACCAGCCCCTGCAGTTGTTTCTGTAACTCCAGCTGTCAAAGATCCATTGGTTAAGAAACCAGAA 
ACTAAACTGAAACAAGAGACAACTTTTCTAGCGTTTAAGAGAACAGAAGTCAAGACATCCACAGTTATTTCAGGAAATTCTTCT 
AGTGCCAGCGTTTCCTCGTCAGTAACTAGTGGCTTAACTGGATGGGCAGCTTTTGCAGCCAAAACTTCCTCTGCTGGTCCTTCA 
ACAGCAAAATTGAGTTCAACAACACAAAACAATACTGGGAAACCTGCTACTTCGTCAGCTAACCAGAAACCTGTGGGTTTGACT 
GGTCTGGCAACATCATCCAAAGGTGGAATAGGTTCCAAAATAGGTTCCAATAACAGCACTACGCCCACTGTACCTTTAAAACCA 

30 CCTCCACCTCTAACCTTGGGTAAAACTGGCCTTAGTCGCTCAGTTAGTTGTGACAATGTCAGCAAAGTAGGTCTTCCTAGTCCA 
AGTAGTTTAGTTCCAGGAAGCAGCAGCCAACTAAGTGGGAATGGAAATAGTGGAACATCAGGACCTAGTGGAAGTACTACCAGC 
AAAACTACTTCAGAATCCAGCAGCTCTCCCTCAGGATCCCTTAAAGGCCCAACTTCACAAGATCACAGCTCAATGCTATGAAGC 
GATTACAGATGGTCAAGAAGAAAGCTGCCCAAAAGAAACTCAAGAAGTAATGTGGCCAAGTAGGTTTTTGTATCATATTAGCCT 
AAAGATGAMGGCTTATTATTATGATATMTC^ 

35 AA 

37 

VDSMELLFLDTFKHPSAEQSSHIDWRFPCW 

LGSLEYDENTSIIFRPNSKVNTDGLVLRGWYNCLTLAIYGSVDRVISHDRDSPPPPPPPPPPPQPQPSLKRNPKHADGEKEDQF 
NGSPPRPQPRGPRTPPGPPPPDDDEDDPVPLPVSGDKEEDAPHREDYFEPISPDRNSVPQEGQYSDEGEVEEEQQEEGEEDEDD 

40 VDVEEEEDEDEDDRRTVDSIPEEEEEDEEEEGEEDEEGEGDDGYEQISSDEDGIADLERETFKYPNFDVEYTAEDLASVPPMTY 
DPYDRELVPLLYFSCPYKTTFEIEISRMKDQGPDKENSGAIEASVKLTELLDLYREDRGAKWVTALEEIPSLIIKGLSYLQLKN 
TKQD S LGQL VDWTMQ ALNLQVALRQ P I ALNVRQLKAGTKL VS S L AECGAQGVTGLL Q AGVI SGLFELLFADHVS S S LKLNAFKA 
LDSVISMTEGMEAFLRGRQNEKSGYQKLLELILLDQTVRWTAGSAILQKCHFYEVLSEIKRLGDHLAEKTSSLPNHSEPDHDT 
DAGLERTNPEYENEVPASMDMDLLESSNISEGEIERLINLLEEVFHLMETAPHTMIQQPVKSFPTMARITGPPERDDPYPV 

45 YLHSHHFLELVTLLLSIPVTSAHPGVLQATKDVLKFLAQSQKGLLFFMSEYEATNLLIRALCHFYDQDEEEGLQSDGVIDDAFA 
LWLQDSTQTLQCITELFSHFQRCTASEETDHSDLLGTLHNLYLITFNPVGRSAVGHVFSLEKNLQSLITLMEYYSKEALGDSKS 
KKSVAYNYACILILVWQSSSDVQMLEQHMS^ 
GVGLTTALRVLCWACPPPPVEGQQKDLKTOL^ 

TLLKTMLTELLRGGSFEFKDMRVPSALVTLHMLLCSIPLSGRLDSDEQKIQNDIIDILLTFTQGWEKLTISEETLAMTWSLM 
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LKEVLSSILKVPEGFFSGLILLSELLPLPLPMQTTQVIEPHDISVALNTRKLWSMHLHVQAKLLQEIVRSFSGTTCQPIQHMLR 
RICVQLCDLASPTALLIMRTVLDLIVEDLQSTSEDKEKQYTSQTTRLLALLDALASHKACKLAILHLINGTIKGDERYAEIFQD 
LLALVRSPGDSVIRQQCVEYVTSILQSLCDQDIALILPSSSEGSTSELEQLSNSLPNKELMTSICDCLLATLANSESSYNCLLT 
CVRTMMFLAEHDYGLFHLKSSLRKNSSALHSLLKRWSTFSKDTGELASSFLEFMRQILNSDTIGCCGDDNGLMEVEGAHTSRT 
5 MSINAAELKQLLQSKEESPENLFLELEKLVLEHSKDDDNLDSLLDSWGLKQMLESSGDPLPLSDQDVEPVLSAPESLQNLFM 
RTAWLADVMDDQLKSMWFTPFQAEEIDTDLDLVKVDLIELSEKCCSDFDLHSELERSFLSEPSSPGRTKTTKGFKLGKHKHET 
FITSSGKSEYIEPAKRAHWPPPRGRGRGGFGQGIRPHDIFRQRKQNTSRPPSMHVDDFVAAESKEWPQDGIPPPKRPLKVSQ 
KEQEAPVGVLRILLEEITMKWEARAILTEALFHHYDPLVLQVTAQVLGTVLLEWGDLDLPGLVQIAAVEAQEESLLVEAWE 
WMYAPLHDKNPFGNILTVYEHFTRTIKIRH 

10 38 

GGTGGACTCGGCGATGGAGCTGTTATTTTTAGATACTTTTAAACACCCGAGCGCTGAGCAAAGTTCTCATATAGATGTGGTTCG 
TTTTCCATGTGTGGTTTATATCAATGAAGTCCGAGTCATACCCCCAGGAGTAAGAGCCCATAGCAGTCTGCCAGACAATAGAGC 
ATATGGAGAGACATCTCCCCATACATTTCAATTAGACTTATTCTTCAACAATGTAAGCAAACCAAGTGCCCCTGTTTTCGATAG 
GTTGGGAAGCCTGGAATATGATGAGAATACTTCCATCATCTTTAGACCTAACTCAAAGGTGAATACTGATGGTCTGGTGCTAAG 

15 AGGCTGGTATAACTGTCTGACACTGGCAATATATGGATCAGTGGATAGAGTGATAAGTCATGACAGAGACTCTCCACCACCACC 
ACCTCCACCGCCACCACCTCCCCAGCCACAACCAAGTTTGAAAAGGAATCCAAAACATGCTGATGGGGAGAAAGAAGATCAGTT 
TAATGGAAGCCCTCCAAGACCACAGCCAAGGGGACCAAGAACTCCTCCAGGACCCCCTCCACCTGATGATGATGAAGATGATCC 
TGTGCCTCTGCCAGTGTCTGGTGACAAGGAAGAGGATGCTCCTCATAGAGAAGATTACTTTGAGCCCATTTCTCCTGATCGGAA 
TTCTGTTCCCCAGGAAGGGCAATATTCTGATGAAGGAGAAGTAGAAGAGGAACAACAAGAAGAAGGAGAAGAAGATGAAGATGA 

20 TGTGGATGTAGAGGAAGAAGAGGATGAGGATGAGGATGATCGACGAACAGTAGACAGTATTCCTGAGGAGGAAGAGGAAGATGA 
AGAGGAAGAAGGTGAAGAGGATGAAGAAGGTGAAGGGGATGATGGTTATGAACAAATTTCCAGTGATGAAGATGGAATTGCTGA 
CTTGGAACGTGAAACATTTAAGTATCCAAACTTTGATGTTGAATACACTGCTGAAGACTTAGCTTCAGTTCCTCCTATGACATA 
TGATCCATATGACAGGGAGCTTGTACCACTCTTATACTTCAGTTGTCCATACAAGACTACTTTTGAAATTGAAATCAGTAGAAT 
GAAGGATCAAGGTCCAGATAAAGAAAATTCAGGGGCAATCGAAGCCTCAGTGAAGTTAACAGAACTCTTAGATTTGTATAGAGA 

25 AGATAGAGGTGCAAAATGGGTAACAGCTTTAGAAGAAATTCCAAGTTTAATAATAAMGGGTTAAGCTATTTGCAATTGAA 

CACAAAACAAGACTCCCTTGGCCAGTTGGTAGACTGGACCATGCAAGCTTTAAATTTACAAGTAGCGCTTCGCCAACCTATCGC 
CTTAAATGTTCGACAGCTCAAAGCTGGGACCAAATTAGTGTCCTCACTAGCAGAATGTGGGGCTCAAGGAGTTACAGGACTGCT 
ACAAGCAGGAGTGATCAGTGGATTATTTGAACTTCTGTTTGCTGATCACGTATCATCTTCTCTTAAGTTAAATGCTTTTAAAGC 
TTTGGACAGTGTCATTAGTATGACAGAAGGAATGGAAGCTTTTTTAAGAGGTAGGCAGAATGAAAAAAGTGGTTATCAAAAGCT 

30 TCTGGAACTCATACTTTTAGATCAGACTGTGAGGGTTGTTACTGCTGGTTCAGCTATTCTCCAAAAATGCCATTTCTATGAAGT 
CTTGTCAGAGATTAAAAGACTTGGTGACCATTTAGCAGAGAAGACTTCATCTCTTCCTAACCACAGTGAACCTGATCACGACAC 
AGATGCTGGACTTGAGAGAACAAACCCAGAATATGAAAATGAGGTGGAAGCTTCTATGGATATGGATCTTTTGGAATCCTCAAA 
TATAAGTGAAGGGGAAATAGAAAGGCTTATTAACCTCCTAGAAGAAGTTTTTCATTTAATGGAAACTGCCCCTCATACAATGAT 
CCAACAACCTGTTAAGTCTTTCCCAACGATGGCACGAATTACTGGACCTCCAGAGAGGGATGATCCATACCCTGTTCTCTTTAG 

35 ATATCTTCACAGTCATCACTTCTTGGAGTTGGTTACCTTGCTTCTGTCAATTCCAGTAACAAGTGCTCACCCTGGTGTGCTGCA 
AGCCACAAAAGATGTTTTGAAGTTTCTTGCACAGTCACAGAAGGGTCTTCTTTTTTTTATGTCGGAATATGAAGCAACAAATTT 
ATTGATCCGAGCTCTGTGTCACTTTTATGATCAAGATGAGGAGGAAGGTCTCCAATCTGATGGTGTTATTGATGATGCATTTGC 
CTTGTGGCTACAGGACTCAACACAGACATTGCAATGTATTACAGAACTGTTCAGCCATTTTCAGCGTTGTACAGCCAGTGAAGA 
AACAGACCATTCAGATCTCTTGGGAACCCTGCACAATCTTTATTTGATTACTTTTAATCCTGTGGGAAGATCAGCTGTTGGCCA 

40 TGTTTTTAGTCTGGAGAAAAATCTCCAAAGTCTTATTACTCTAATGGAGTACTATTCCAAAGAAGCCTTGGGTGATTCCAAATC 
TAAGAAGTCAGTAGCTTATAATTACGCATGCATACTTATTTTGGTGGTGGTTCAGTCTTCCAGTGATGTTCAAATGCTAGAACA 
ACATGCAGCATCTCTCTTGAAGCTTTGTAAAGCAGATGAAAATAATGCTAAATTGCAAGAACTTGGCAAGTGGCTTGAACCTCT 
GAAAAACCTTAGATTTGAAATTAACTGCATCCCAAACTTAATTGAGTATGTTAAGCAGAATATCGATAACTTGATGACCCCAGA 
AGGAGTTGGCCTTACCACTGCCTTACGTGTTCTCTGTAATGTTGCATGCCCACCACCTCCTGTTGAAGGTCAACAGAAAGATCT 

45 GAAATGGAATCTTGCCGTTATTCAGCTTTTTTCTGCTGAAGGAATGGACACGTTTATTCGAGTTCTGCAAAAATTGAACAGTAT 
TCTGACTCAGCCTTGGAGGCTCCATGTCAACATGGGGACTACCCTTCACAGAGTTACTACTATTTCAATGGCTCGCTGCACACT 
CACTCTTCTTAAAACTATGTTAACGGAACTCCTGAGAGGTGGATCCTTTGAGTTTAAGGACATGCGTGTTCCTTCAGCGCTTGT 
TACTTTACATATGCTCCTGTGCTCTATCCCCCTCTCAGGTCGTTTGGATAGTGATGAACAGAAAATTCAGAATGATATCATTGA 
TATTTTACTGACTTTTACACAAGGAGTTAATGAAAAACTCACAATCTCAGAAGAGACTCTGGCCAATAATACTTGGTCTTTAAT 

50 GTTAAAAGAAGTTCTTTCTTCAATCTTGAAGGTTCCTGAAGGATTTTTTTCTGGACTCATACTCCTTTCAGAGCTGCTGCCTCT 
TCCATTGCCCATGCAAACAACTCAGGTTATTGAGCCACATGATATATCAGTGGCACTCAACACCCGAAAATTGTGGAGCATGCA 
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CCTTCATGTTCAAGCAAAGTTGCTCCAAGAAATAGTTCGCTCTTTCTCTGGCACAACCTGCCAGCCCATTCAACATATGTTACG 
GCGTATTTGTGTTCAATTGTGTGACCTTGCCTCACCAACTGCACTTCTGATTATGAGAACTGTGTTGGATTTGATTGTAGAAGA 
CTTGCAAAGCACTTCAGAAGATAAAGAAAAACAGTATACTAGCCAAACCACCAGGTTGCTTGCTCTTCTTGATGCTCTGGCTTC 
ACACAAAGCTTGTAAATTAGCTATTTTGCATCTAATTAATGGAACTATTAAAGGTGATGAAAGATATGCAGAGATATTCCAGGA 
5 TCTTTTAGCTTTGGTGCGGTCTCCTGGAGACAGTGTTATTCGCCAACAGTGTGTTGAATATGTCACATCCATTTTGCAGTCTCT 
CTGTGATCAGGACATTGCACTTATCTTACCAAGCTCTTCTGAAGGTTCTATTTCTGAACTGGAGCAGCTCTCCAATTCTCTACC 
AAATAAAGAATTGATGACCTCAATCTGTGACTGTCTGTTGGCTACGCTAGCTAACTCTGAGAGCAGTTACAACTGTTTACTGAC 
ATGTGTCAGAACAATGATGTTTCTTGCAGAGCATGATTATGGATTATTTCATTTAAAAAGTTCTTTAAGGAAAAACAGTAGTGC 
TCTGCATAGTTTACTGAAACGAGTGGTCAGCACATTTAGTAAGGACACAGGAGAGCTTGCATCTTCATTTTTAGAATTTATGAG 

10 ACAAATTCTTAACTCTGACACAATTGGATGCTGTGGAGATGATAATGGTCTCATGGAAGTAGAGGGAGCTCATACATCACGGAC 
GATGAGTATTAATGCTGCAGAGTTAAAACAGCTTCTACAAAGCAMGAAGAAAGTCCAGAAAATTTGTTCCTTGAACTAGAGAA 
GCTTGTTTTGGAACATTCAAAAGATGATGACAATCTGGATTCTTTGTTGGACAGTGTAGTTGGACTTAAGCAGATGCTGGAGTC 
ATCAGGTGACCCTTTACCTCTCAGTGACCAGGATGTAGAACCAGTACTTTCAGCTCCAGAATCTCTTCAGAATCTGTTTAACAA 
TAGGACTGCCTATGTGCTTGCTGATGTCATGGATGATCAGTTGAAATCTATGTGGTTCACTCCATTTCAGGCTGAAGAGATAGA 

15 TACAGATCTGGATTTGGTAAAGGTTGACTTAATTGAACTCTCTGAAAAATGCTGTAGTGACTTTGATTTGCACTCAGAATTAGA 
GCGCTCATTTTTGTCAGAACCATCATCTCCAGGAAGAACCAAGACTACTAAAGGATTCAAACTTGGGAAGGACAAGCATGAGAC 
CTTTATAACGTCAAGTGGAAAATCTGAATACATTGAACCTGCCAAAAGAGCTCATGTTGTGCCACCACCAAGAGGAAGGGGCAG 
GGGAGGATTTGGACAGGGTATACGACCTCATGATATTTTTCGTCAGAGAAAACAGAACACAAGTAGACCACCATCTATGCATGT 
GGATGACTTTGTTGCTGCTGAAAGTAAAGAAGTGGTTCCTCAAGATGGAATACCTCCACCAAAACGGCCACTCAAAGTATCACA 

20 GMGGAACAAGAGGCTCCAGTTGGAGTGCTCAGAATACTCCTCGAGGAAATTACAATGAAAGTCGTGGAGGCCAGAGCAATTTT 
AACAGAGGCCCTCTTCCACCATTACGACCCCTTAGTTCTACAGGTTACCGCCCAAGTCCTCGGGACCGTGCTTCTAGAGGTCGT 
GGGGGACTTGGACCTTCCTGGGCTAGTGCAAATAGCGGCAGTGGAGGCTCAAGAGGAAAGTTTGTTAGTGGAGGCAGTGGTAGA 
GGTCGTCATGTACGCTCCTTTACACGATAAAAATCCTTTTGGGAACATCTTAACTGTATATGAACATTTCACGAGGACAATAAA 
AATAAGACATTGAAGGACCAATTTAGACTTAGCAGTTATCTGGAGACATCTGAGAGAATATTTTTATCTGAAGAAAGCAGAATT 

25 TGTTTGATACCTAACAAGATTTCAATAAAAATCCAAACTTTGTATGT 

39 

MAARDSDSEEDLVSYGTGLEPLEEGERPKKPIPLQDQTVRDEKGRYKRFHGAFSGGFSAGYFNTVGSKEGWTPSTFVSSRQNRA 
DKSVLGPEDFMDEEDLSEFGIAPKAIVTTDDFASKTKDRIREKARQLAAATAPIPGATLLDDLITPAKLSVGFELLRKMGWKEG 
QGVGPRVKRRPRRQKPDPGVKIYGCALPPGSSEGSEGEDDDYLPDNVTFAPKDVTPVDFTPKDNVHGLAYKGLDPHQALFGTSG 
30 EHFNLFSGGSERAGDLGEIGLNKGRKLGISGQAFGVGALEEEDDDIYATETLSKYDTVLKDEEPGDGLYGWTAPRQYKNQKESE 
KDLRYVGKILDGFSLASKPLSSKKIYPPPELPRDYRPVHYFRPMVAATSENSHLLQVLSESAGKATPDPGTHSKHQLNASKRAE 
LLGETPIQGSATSVLEFLSQKDKERIKEMKQATDLKA^ 

PFAKDPEKQKRYDEFLVHMKQGQKDALERCLDPSMTEWERGRERDEFARAALLYASSHSTLSSRFTHAKEEDDSDQVEVPRDQE 
NDVGDKQSAVKMKMFGKLTRDTFEWHPDKLLCKRFNVPDPYPDSTLVGLPRVKRDKYSVFNFLTLPETASLPTTQASSEKVSQH 
35 RGPDKSRKPSRWDTSKHEKKEDSISEFLSLARSKAEPPKQQSSPLWKEEEHAPELSANQTWKDVDAQAEGEGSRPSMDLFRA 
IFASSSDEKSSSSEDEQGDSEDDQAGSGEANFQSSQDTDLGETSSVAHALVPAPQEPPPSFPIQKMQIDEREEFGPRLPPVFCP 
NARQTLEVPQKEKHKKNKDKHKAKKEHRRKKEKKKKHRKHKHKGKQKNKKPEKSSSSESSDSSDSQSDEETADVSPQELLRRLK 
SLPLRRQ 

40 

40 GTGTTAGAATTTCTGTCCCAAAAAGACAAAGAGAGAATCAAAGAAATGAAGCAGGCAACTGACCTGAAAGCAGCTCAGCTCAAG 
GCCAGGAGTCTGGCCCAGAACGCTCAGAGCAGCAGAGCCCAGCTCTCCCCTGCAGCGGCTGCTGGGTACTGCTCTTGGAACATG 
GCATTAGGTGGTGGGACGGCCACCTTAAAAGCCAGCAACTTCAAGCCTTTCGCCAAAGATCCGGAAMGCAAAAGCGATACGAC 
GAGTTCTTAGTACACATGAAACAGGGTCAGAAAGATGCTCTGGAACGCTGTCTGGACCCCAGCATGACGGAGTGGGAGCGAGGC 
CGTGAGCGGGATGAGTTTGCCCGGGCGGCCCTGCTGTACGCATCTTCCCATTCGACCTTGTCCTCCAGGTTCACTCACGCCAAG' 

45 GAGGAGGATGACTCAGATCAGGTTGAAGTCCCTCGAGACCAAGAGAATGATGTCGGGGATAAGCAGTCGGCTGTGAAGATGAAG 
ATGTTTGGGAAGCTCACCCGAGACACGTTTGAGTGGCACCCTGACAAGCTTCTATGTAAGAGATTTAATGTCCCTGACCCTTAT 
CCAGATTCAACTTTAGTTGGCTTACCAAGAGTGAAGCGTGACAAGTACTCAGTCTTCAACTTTCTGACGCTCCCAGAGACAGCT 
TCCTTGCCCACCACTCAAGCATCAAGTGAAAAAGTATCACAGCACCGAGGTCCCGACAAATCAAGAAMCCATCCAGATGGGAT 
ACCTCTAAACACGAAAAGAAAGAAGATTCCATTAGTGAATTTTTAAGTTTGGCTAGATCAAAAGCCGAGCCACCTAAACAACAG 

50 TCCAGCCCCTTAGTAAACAAAGAGGAAGAGCATGCACCAGAATCATCCGCAAATCAGACAGTCAACAAAGATGTGGACGCACAG 
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GCTGAAGGAGAAGGGAGCCGCCCATCCATGGACTTATTCAGGGCCATCTTTGCCAGTTCCTCAGATGAAAAGTCCTCATCCTCC 
GAGGATGAGCAAGGTGACAGTGAAGATGATCAGGCAGGCTCTGGGGAGGCCAACTTCCAAAGCTCCCAAGACACTGACTTGGGG 
GAAACATCATCTGTGGCTCACGCTCTTGTGCCAGCACCCCAGGAGCCGCCACCTTCCTTCCCGATACAAAAGATGCAGATAGAT 
GAAAGAGAAGAGTTCGGCCCGCGGCTGCCTCCCGTCTTCTGCCCCAATGCTCGTCAGACACTTGAGGTTCCTCAAAAAGAGAAA 
5 CATAAAAAGAACAMGACAAGCACAAGGCCMGAAAGAGCACAGGCGGMGAMGAGAAGAAAAAGAAACACAGGAAGCACAAA ' 
CACAMGGCAAGCAAAAGAATAAAAAACCAGAGAAAAGTAGTAGCTCCGAGAGTTCCGACAGCAGCGACAGCCAGAGTGACGAG 
GAAACCGCAGACGTGTCGCCCCAGGAGCTGCTGAGACGGCTGAAAAGTCTTCCACTAAGAAGGCAGTAATTGAATGCTGCCCTG 
GCTCGTCCTAGAATCATTTCTCCTCCATGATGGAAGCCCAGTGATTGTTCAGTTAACGCATTGTACAGAGTGTATTTATATGTA 
AATTCCTGCTGTAAAATAATTTTTAAAACCTTGACATTTCAAAGACTGCCTTGAAAGGTCGCAGAAATTGATTTCTATTTTTAA 
10 TTTCAGTTAGTATCGGGGGAAAAAAATCCAGATTGAACAGTTTAATTAAAGTGGAATTTTTCT 

41 

MAQVSIIMDYSEWDLSTDAGERARLLQSPCV^ 

GIALQMGMLVFI I SGLVILAYCSQASNERTYQEWWAVCGKLTGVLCEVAI AVYTFGTC IAFLI I IGDQQDKI IAVMAKEPEGA 
SGPWYTDRKFTISLTAFLFILPLSIPREIGFQKYASFLSWGTWWTAIVIIKYIWPDKEMTPGNILTRPASWMAVF 
15 FGFQCHVSSVPVFNSMQQPEVKTWGGWTAAMVIALAVYMGTGICGFLTFGMVDPDVLLSYPSEDM 

PILHFCGRAWEGLWLRYQGVPVEEDVGRERRRRVLQTL WFLLTLLLALFI PDIGKVI SVIGGLAACFIFVFPGLCL IQAKLS 
EMEEVKPASWWVLVSYGVLLVTLGAFIFGQTTANAIFVDLLA 

42 

ATGTGATGCATGCTCACGTGTCTCCGGAGCCGGCTCGGGAAAGAATCCCCCAAGCTGCATTTCATGAGTAAGCGTGAGAGCCGC 
20 TCAGTTTCCTCCAGCTCTGCTGAAGCCAGCACAGAAGTAGCCCAAACTCTTCCCTCTGCTGACAGCAAATTTTAGGCAAAGTCT 
TGAGAAAGAAGAAATTGGGTCCAGAAAGGGAAGTGAGGAGAATCAGATCCCAGACCTTTGGGGAGAAGGAGCAACCGCCTCTGG 
CACAGCCCATCAGGGAGAAAGAGCAGGTTGAGAAGAGTCCTAAGCTAACAGCCCCAAACAGGTGGGTGTTGCTCAGCTCCCTGA 
GGCATGTGGTTGTAAGGCAGAACCCACAGACCTTGCAGGAAGAAGGCTCTCGGGGCCATGGCCCAGGTCAGCATCAACAATGAC 
TACAGCGAGTGGGACTTGAGCACGGATGCCGGGGAGCGGGCTCGGCTGCTGCAGAGTCCCTGTGTGGACACAGCCCCCAAGAGT 
25 GAGTGGGAAGCCTCTCCTGGGGGTCTGGACAGAGGCACCACTTCCACACTTGGGGCCATCTTCATCGTCGTCAACGCGTGCCTG 
GGTGCAGGGTTACTCAACTTCCCAGCAGCCTTCAGCACTGCGGGGGGCGTGGCAGCAGGCATCGCACTGCAGATGGGTATGCTG 
GTTTTCATCATCAGTGGCCTTGTCATCCTGGCCTACTGCTCCCAGGCCAGCAATGAGAGGACCTACCAGGAGGTGGTATGGGCT 
GTGTGTGGCAAGCTGACAGGTGTGCTATGTGAGGTGGCCATCGCTGTCTACACCTTTGGCACCTGCATTGCCTTCCTAATCATC 
ATTGGCGACCAGCAGGACAAGATTATAGCTGTGATGGCGAAAGAGCCGGAGGGGGCCAGCGGCCCTTGGTACACAGACCGCAAG 
30 TTCACCATCAGCCTCACTGCCTTCCTCTTCATCCTGCCCCTCTCCATCCCCAGGGAGATTGGTTTCCAGAAATATGCCAGCTTC 
CTGAGCGTCGTGGGTACCTGGTACGTCACAGCCATCGTTATCATCAAGTACATCTGGCCAGATAAAGAGATGACCCCAGGGAAC 

• ATCCTGACCAGGCCGGCTTCCTGGATGGCTGTGTTCAATGCCATGCCCACCATCTGCTTCGGATTTCAGTGCCACGTCAGCAGT 
GTGCCCGTCTTCAACAGCATGCAGCAGCCTGAAGTGAAGACCTGGGGTGGAGTGGTGACAGCTGCCATGGTCATAGCGCTCGCT 
GTCTACATGGGGACAGGCATCTGTGGCTTCCTGACCTTTGGAGCTGCTGTGGATCCTGACGTGCTCCTGTCCTATCCCTCGGAG 

35 GACATGGCCGTGGCCGTTGCCCGAGCCTTCATCATCCTGAGCGTGCTCACCTCCTACCCTATCCTGCACTTCTGTGGGCGGGCG 
GTGGTGGAAGGCCTGTGGCTGCGCTACCAGGGGGTGCCAGTGGAGGAGGACGTGGGGCGGGAGCGGCGGCGGCGAGTGCTGCAG 
ACGCTGGTCTGGTTCCTGCTCACCCTGCTGCTGGCGCTCTTCATCCCTGACATCGGCAAGGTGATCTCAGTCATTGGAGGCCTG 
GCCGCCTGCTTCATCTTCGTCTTCCCAGGGCTGTGCCTCATTCAAGCCAAACTCTCTGAGATGGAAGAGGTCAAACCAGCCAGC 
TGGTGGGTGCTGGTCAGCTACGGAGTCCTCTTGGTCACCCTGGGAGCCTTCATCTTCGGCCAGACCACAGCCAACGCCATCTTT 

40 GTGGATCTCTTGGCATAACCACTGCCTCCCAGGGAACACAAGGCCTTTGCCATTGGTCGCAGGAACCCATCTCTTAGAGCTATG 
GGGCCATTCTTAGTCCACGATCATTCCAACTGGTGGGATGACATCCGGACATCCTCTTCCAGGGACTGGGGCAAACTCAGGCCC 
CACACCTCTGGACAGCTCAAATCCAGTCCCCTTCCTGCTCCCCAGTCCTGGCAGTGCCGTGGATGGCGGCAGGAAGTCTCACAT 
CATAGAGGACCCCTCCTCCTCTCCCAGTTCTCAACTTCTCCATGCCTGGAATCCACGGGTGAAGAGAGTCGGTAGATCTCATAA 

• "GAAAGAATCCAGTCTGACTTCCCTCTGGAGAATGACTATGGACAGAAGGCCACCATCCTCCACAGAGCACCCTGTCCTGAGTAG 
45 GGGTTGTGCTCATTACCCCAGGCCAGTGGTAGCTTCCTCAGGAGCCTGGCCACTTCCAAAGGTAGCACTGAAGTCGTGCAAATA 

CATAGTCAGGTAGATTCAGACCTTGTCCACACCTTCCTGGGCAACCCCCACCATGAACCTGTCAGCCTCTTTCCCATAGCTAAT 
AGACATTTCCCAGGCCTTGAGGGGCCCCACCCTGTCTCTTTCATCAAACCTGATGGTCCAGGCTGGGCATCCCTCTCCTCCTCC 
ATCCCCAGACATCACCAGGTCTAATGTTTACAAACGGTGCCAGCCCGGCTCTGMGCCAAGGGCCGTCCCGTGCCACGGTGCTG 
TGAGTATTCCTCCGTTAGCTTTCCCCATAAGGTTGGGAGTATCTGCTTTTGTGTCTGAGATGGGCCCCTCTTTTCAGAGGCCGC 
50 AGGGTGGGTGATGGAGAAGGCTGAGAACCTTTCAGACCCTCTGTGTGGGCTGGGCTGGTCAGAATCAGGGTGTACCTCCCCGAC 
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ACCTTCTTTTTCAGTGATGTTTTCTCTTCTCCCTGCCTTTCCTCTGCCTCCTCCCCTGCCAGCCCTAGCGTGACTACCCAGAGA 
C 

43 

MAQVSIMDYSEWDLSTDAGERARLLQSPCVDTAPKSEWEASPGGLDRGTTSTLGAIFIVVNACLGAGLLNFPAAFSTAGGVAA 
5 GIALQMGMLVFIISGLVILAYCSQASNERTYQEVW^ 
SGPWYTDRKFTISLTAFLFILPLSIPREIGFQKYA 

FGFQCHVSSVPVFNSMQQPEVKTWGGWTAAWIALAWMGTGICGFLTFGAAVDPDVLLSYPSEDMAVAVARAFIILSVLTSY 
PILHFCGRAWEGLWLRYQGVPVEEDVGRERRRRVLQTLVWFLLTLLLALFIPDIGKVISVIGGLAACFIFVFPGLCLIQAKLS 
EMEEVKPASWm/LVSYGVLLVTLGAFIFGQTTANAIFVDLLA 

10 44 

GAGGCCAAGAGCGCGGGCGGCGAGGCAAGATGGCGGCAACCAAGAGGAAACGGCGTGGAGGCTTTGCAGTTCAGGCGAAGAAGC 
CAAAAAGAAACGAAATAGATGCGGAGCCGCCAGCTAAGCGGCACGCCACAGCAGAGGAGGTGGAGGAAGAAGAGAGGGACCGGA 
TCCCAGGCCCCGTTTGCAAGGGAAAGTGGAAAAATAAGGAACGGATTCTCATCTTTTCTTCCAGAGGAATAAATTTTAGAACAA 
GACATTTAATGCAGGACTTGAGAATGTTGATGCCTCATTCTAAAGCAGATACTAAAATGGATCGTAAGGATAAGCTATTTGTGA 
15 TTAACGAGGTTTGTGAAATGAAGAACTGTAATAAATGCATCTATTTTGAAGCTAAGAAAAAACAGGATCTCTATATGTGGTAAG 
AGAATGTATTAAGATTTTGGTTAAACTCATTTAAGTGGATTTGTTCTTTGTACCTTTTATGTTATAGACTCCTAGTGTTTCATG 
TTAAGGGTTGAGAATGAAGCAAACTTTTGATTTCACGAA 

TATTATTTATGGTGAATAGGTGGTAAGCATTGACTACATTTGCTAATTAGTGGAAAAAGCTTACTCCATATCTTTCAGGCTTTC 
AAATTCACCTCACGGACCATCTGCTAAATTCCTTGTTCAAAATAGTAAGTTGACTCAATAAATTTTTTTAGATGAGGAAATTAA 

20 AAGTAATCTTTTGAAATAGTTGTGCAAAAGTTACAACTATTTTTGTTTTTGTTGTAATTTTTCTAGTTCATACCCTCGCTGAAC 
TGAAGATGACTGGAAACTGTTTGAAAGGTTCTCGGCCCCTTTTGTCTTTTGACCCTGTAAGTTTCTCATTCAGTGTATGAGGTC 
TAATTTTCTTATCTGGCATGGTTGTTTTTAGTAGATATAGTTGTGTTATTCAATTTAGTTTCACCCCCACCTTGGTTTTATGGA 
TTATCAATTCTTTCAGGTACATTTATAATGAGGTTATAGCCAAGTTATAGAAACAAATGAGCAAACAGTTTTAAACAAGGCAGT 
CTACTGTTAAATAATATGCTTACCTTATTTTTATAGGCTTTTGATGAATTACCACATTATGCTTTGTTAAAAGAACTCTTAATT 

25 CAGGTAAATATCTTTAAAATTAGCTATCCAAAATATACATGATATATCTTGGAATAAGGAACTAACATACACTTTTTTAACTAG 
ATCTTTAGTACACCACGGTATCATCCCAAAAGCCAACCATTTGTGGACCACGTGTTTACTTTCACCATTTTGGATAATAGGATA 
TGGTTTCGGAACTTTCAGATCATAGAAGAAGATGCTGCTCTTGTAGAAATAGGACCTCGTTTTGTCTTAAATCTCATAAAGATT 
TTCCAGGGAAGTTTTGGAGGACCAACTTTATATGAAAATCCTCACTACCAGTCACCAAACATGGTAAGCGGTTGTGTCATTCAG 
TAGTCTTCAATGTACCAGAACCCTTTGGCCAAAATGATTCTGTGAAGTAATTGCTGTTTTATTACCTGTGAAACTGAATTTGGT 

30 TTTTGCTGCATAGAAACTTGGAAATTACTCCCTTTTTTCATGACACTGGCTGAAAGGATATATGGTTCATAACTGAAAGTCTTT 
GCCTTATATAAAATGTTTATTTTTATTTCAAAAGCATCTAATCTTTTTTTTTTTTTTTTAGCATCGGCGTGTCATAAGATCCAT 
CACAGCTGCAAAATACAGAGAGAAACAGCAAGTGAAAGATGTGCAAAAACTGAGAAAGAAAGAGCCGAAGACTCTTCTTCCACA 
TGATCCCACTGCAGATGTTTTTGTAACACCAGCTGAGGAGAAACCAATAGAAATACAGTGGGTAAAACCAGAGCCAAAAGTTGA 
TTTGAMGCAAGAMGAAACGGATTTACAAAAGGCAAAGAAAAATGAMCA 

35 TGGAAACCTGATTTGTTTTTCAGTTACTT^ 

GTCTAATTAGTGTAGCATTTACAAGAAAGAAAAATTO 
AATTCATCAGTT 

45 

MDWKEVLRRRLATPNTCPNKKKSEQELKDEEMDLFTKYYSEWKGGRKNTNEFYKTIPRFYYRLPAEDEVLLQKLREESRAVFLQ 
40 RKSRELLDNEELQNLWFLLDKRQTPPMIGEEAMM^ 

WLHQTRIGLSLYDVAGQGYLRESDLENYILELIPTLPQLDGLEKSFYSFYVCTAVRKFFFFLDPLRTGKIKIQDILACSFLDDL 

LELIO)EELSKESQETNWFSAPSALRVYGQYLNLDKDHNGM 

LENRREPAALQYIFKLLDIENKGYLIWFSLM^ 

TT I L IDLNGFWTYENREALVANDS ENSADLDDT 

45 46 

AAAAATCCGGCTTGGTGAGCTTGGGTCGCCTCTGAAGGAGAACCATTTTCCATCTCTTTCATAGTTTTTTCCCCCAGTCAGCGT 
GGTAGCGGTATTCTCCGCGGCAGTGACAGTAATTGTTTTTGTCTCTTTAGCCAAGACTTCCGCCCTCGATCAAGATGGTGGTTG 
GACGGCCTTCCTAACCTTTACGGGGCCTGGCGGTGCTGACGCCTGAGCTGGTAGGGGTGGAGCAGGTAGGAAACAGCAAATGCA 
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GAAGCTGCTGCGCGGGAGTCGGCCATGGACTGGAAAGAAGTTCTTCGTCGGCGCCTAGCGACGCCCAACACCTGTCCAAACAAA 
AAAAAAAGTGAACAAGAATTAAAAGATGMGAAATGGATTTATTTACAAMTATTACTCCGAATGGAAAGGAGGTAGAA 
ACAAATGAATTCTATAAGACCATTCCCCGGTTTTATTATAGGCTGCCTGCTGAAGATGAAGTCTTACTACAGAAATTAAGAGAG 
GAATCMGAGCTGTCTTTCTACAMGAAAAAGCAGAGAACTGTTAGATAATGAAGAATTACAGAACTTATGGTTTTTGCTGGAC 
5 AAACGCCAGACACCACCTATGATTGGAGAGGAAGCGATGATCAATTACGAAAACTTTTTGAAGGTTGGTGAAAAGGCTGGAGCA 
AAGTGCAAGCAATTTTTCACAGCAAAAGTCTTTGCTAAACTCCTTCATACAGATTCATATGGAAGAATTTCCATCATGCAGTTC 
TTTAATTATGTCATGAGAAAAGTTTGGCTTCATCAAACAAGAATAGGACTCAGTTTATATGATGTCGCTGGGCAGGGGTACCTT 
CGGGAATCTGATTTAGAAAACTACATATTGGAACTTATCCCTACGTTGCCACAATTAGATGGTCTGGAAAAATCTTTCTACTCC 
TTTTATGTTTGTACAGCAGTTAGGAAGTTCTTCTTCTTTTTAGATCCTTTAAGAACAGGAAAGATAAAAATTCAAGATATTTTA 

10 GCATGCAGCTTCCTAGATGATTTATTGGAGCTAAGGGATGAGGAACTGTCCAAGGAGAGTCAAGAAACAAATTGGTTTTCTGCT 
CCTTCTGCCCTAAGAGTTTATGGCCAGTACTTGAATCTTGATAAAGATCACAATGGCATGCTCAGTAAAGAAGAACTCTCACGC 
CATGGAACAGCTACCATGACCAATGTCTTCTTAGACCGTGTTTTCCAGGAGTGTCTCACTTATGATGGAGAAATGGACTATAAG 
ACCTACTTGGACTTTGTCCTTGCATTAGAAAACAGAAAGGAACCTGCAGCTCTACAATATATTTTCAAACTGCTTGATATTGAG 
AACAAAGGATACCTGAATGTCTTTTCACTTAATTATTTCTTTAGGGCCATACAGGAACTAATGAAAATCCATGGACAAGATCCT 

15 GTTTCATTTCAAGATGTCAAGGATGAAATCTTTGACATGGTAAAACCAAAGGATCCTTTGAAAATCTCTCTTCAGGATTTAATC 
AACAGTAATCAAGGAGACACAGTAACCACCATTCTAATCGATTTGAATGGCTTCTGGACTTACGAGAACAGAGAGGCTCTTGTT 
GCAAATGACAGTGAAAACTCTGCAGACCTTGATGATACATGATCTCTGAAAGACTAGACTGTCTTATATTATGAGATACTTGAA 
TGCTGCATGTAAAGCCTTTAAAGCAAAATCCTCAGAMTGGTCTAAATAAAACACTTGATATGCCTAGAGAACACAAAAAAAAA 
AAAAAAAAAAAAAAAAA 

20 47 

MPKAKGKTRRQKFGYSWRKR^ 

KELVRKPWLNDLEAEASLPEKKGNTLSRDLIDWRYW 
SLQKRKMEVE 

48 

25 GTTACACGAGGTCTGAGAGACAGAGGCAGCGTGTTTGAGCTGCTGGTGCGGTGGTCAGCGCGATGCCCAAGGCCAAGGGCAAAA 
CCCGGAGGCAGAAGTTTGGTTACAGTGTCAACCGAAAGCGTCTGAACCGGAATGCTCGACGGAAGGCAGCGCGCGGGATCGAAT. 
GCTCCCACATCCGACATGCCTGGGACCACGCTAAATCGGTACGGCAGAACCTGGCCGAGATGGGGTTGGCTGTGGACCCCAACA 
GGGCGGTGCCCCTCCGTAAGAGAAAGGTGAAGGCCATGGAGGTGGACATAGAGGAGAGGCCTAAAGAGCTTGTACGGAAGCCCT 
ATGTGCTGAATGACCTGGAGGCAGAAGCCAGCCTTCCAGAAAAGAAAGGAAATACTCTGTCTCGGGACCTCATTGACTATGTAC 

30 GCTACATGGTAGAGAACCACGGGGAGGACTATAAGGCCATGGCCCGTGATGAGAAGAATTACTATCAAGATACCCCAAAACAGA 
TTCGGAGTAAGATCAACGTCTATAAACGCTTTTACCCAGCAGAGTGGCAAGACTTCCTCGATTCTTTGCAGAAGAGGAAGATGG 
AGGTGGAGTGACTGGTTTAGATCACAGCTGCCCCAGGCTGAGGCGTCCCCCGGACCAGTGAAGCTGGAGCCAGGGTGTAAGGCA 
AGGAGGTGCTGTGTGGCTCCAGAGGAGCTGGCCAGGTCCCATGGAATCAGAAGGTTACACACACACGTGCACACTCCCCGCTCT 
GGGGAAGGAACTGTTCTCAGAGGCTCCAATTTATATTCATCTGGGGGTTCACGGAAAAGCCAGAACCTGCTGTTTTCAGGGTGG 

35 GTGATGTAMTATAGTGTGTACATAATAAAGCAAAT 

49 

MMPEFQKSSWIKNPTRVEEIICGLIKGGAAKLQII^ 

EVDPVLTVEEKYPYWEWYTKSHGLLVQQALPKAKLKEIVAESDVMLKEGYENFFDKLQQHSIPVFIFSAGIGDVLEEVIRQAG 
WHPWKWSNFMDFDETGVLKGFKGELIHVFM^ 
40 RVDELLEKYMDSYDIVLVQDESLEVANSILQKIL 

50 

ATGATGCCAGAATTCCAGAAAAGTTCAGTTCGAATCAAGAACCCTACAAGAGTAGAAGAAATTATCTGTGGTCTTATCAAAGGA. 
GGAGCTGCCAAACTTCAGATAATAACGGACTTTGATATGACACTCAGTAGATTTTCATATAAAGGGAAAAGATGCCCAACATGT 
CATAATATCATTGACAACTGTAAGCTGGTTACAGATGAATGTAGAAAAAAGTTATTGCAACTAAAGGAAAAATACTACGCTATT 
45 GAAGTTGATCCTGTTCTTACTGTAGAAGAGAAGTACCCTTATATGGTGGAATGGTATACTAAATCACATGGTTTGCTTGTTCAG 
CAAGCTTTACCAAAAGCTAAACTTAAAGAAATTGTGGCAGAATCTGACGTTATGCTCAAAGAAGGATATGAGAATTTCTTTGAT 
AAGCTCCAACAACATAGCATCCCCGTGTTCATATTTTCGGCTGGAATCGGCGATGTACTAGAGGAAGTTATTCGTCAAGCTGGT 
GTTTATCATCCCAATGTCAAAGTTGTGTCCAATTTTATGGATTTTGATGAAACTGGGGTGCTCAAAGGATTTAAAGGAGAACTA 
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ATTCATGTATTTAACAAACATGATGGTGCCTTGAGGAATACAGAATATTTCAATCAACTAAAAGACAATAGTAACATAATTCTT 
CTGGGAGACTCCCAAGGAGACTTAAGAATGGCAGATGGAGTGGCCAATGTTGAGCACATTCTGAAAATTGGATATCTAAATGAT 
AGAGTGGATGAGCTTTTAGAAAAGTACATGGACTCTTATGATATTGTTTTAGTACAAGATGAATCATTAGAAGTAGCCAACTCT 
ATTTTAC AGAAGATTC T ATAA 

5 51- 

MHTLVFLSTRQVLQCQPAACQALPLLPRELFPLLFKVAFMDKKTWLiRELVHTWPFPLLSPQQLLQECAHCSRALLQERPSTES 
MQAVILGLTARLHTSEPGASTQPLCRKHALRVLDMTGLLDDGVEQDPGTMSMWDCTAAVARTCIAQQQGGAAEPGPAPIPVEVR 
VDLRVNRASYAFLREALRS S VGS PLRLCCRDLRAEDLPMRNTVALLQLLDAGCLRRVDLRFNNLGLRGLSVI I PHVARFQHL AS 
LRLHYVHGDSRQPSVDGEDNFRYFLAQMGRFTCLRELSMGSSLLSGRLDQLLSTLQSPLESLELAFCALLPEDLRFLARSPHAA 
10 HLKKLDLSGNDLSGSQLAPFQGLLQASAATLLHLELTECQLADTQLLATLPILTQCASLRYLGLYGNPLSMAGLKELLRDSVAQ 
AELRTWHPFPVDCYEGLPWPPPASVLLEASINEEKFARVEAELHQLLLASGRAHVLWTTDIYGRLAADYFSL 

• 52 

CGGGGCTGGAGGCGGTGGCTGCGGTTGCGGGACCGGCACTATGCTGGGCCTTCCTACCACTTGTGTGTGGCTTGGTAGTGGCCT 
AGGGTCTCTCCTCCCTGCTGAAGTCCCTCTCCTGCAGGTGGCCGTCTGCCCGGCCCAGCACCATGCACACGCTTGTGTTCTTGA 

15 GCACACGGCAGGTGCTGCAGTGCCAGCCAGCTGCCTGCCAGGCCCTGCCCCTGCTGCCACGCGAACTCTTCCCCCTGCTGTTCA 
AGGTGGCCTTCATGGACAAGAAGACAGTGGTACTGCGCGAGTTGGTACACACGTGGCCCTTCCCGCTGCTCAGTTTCCAGCAGC 
TGCTACAGGAGTGTGCCCACTGCAGCCGTGCCCTCCTGCAGGAGCGGCCTAGCACTGAGAGCATGCAGGCTGTTATCCTGGGGC 
TGACTGCCCGGCTCCACACCTCAGAGCCTGGGGCCAGCACACAGCCCCTCTGCAGGAAGCATGCGCTGCGGGTGCTGGACATGA 
CGGGCCTCTTGGATGATGGTGTGGAACAGGATCCTGGCACCATGAGCATGTGGGACTGTACTGCTGCCGTAGCTCGCACATGCA 

20 TTGCCCAGCAGCAGGGTGGGGCCGCAGAGCCTGGGCCAGCCCCCATCCCCGTGGAGGTGCGCGTGGACCTGCGGGTGAACCGGG 
CCTCCTATGCGTTCCTGCGGGAGGCACTCCGAAGCAGCGTGGGCAGCCCGCTGCGGCTCTGCTGCCGGGACCTGCGAGCTGAGG 
ACCTGCCCATGCGCAACACTGTGGCCCTGCTGCAGCTTCTGGATGCAGGCTGCCTGCGCCGCGTGGACCTGCGCTTCAACAATC 
TGGGCCTGCGCGGCCTGTCTGTGATCATCCCACACGTGGCCCGCTTCCAGCACCTGGCCAGCCTGCGGCTCCACTATGTGCATG 
GGGATTCAAGGCAGCCCTCCGTGGATGGCGAGGACAACTTCCGCTACTTCCTTGCCCAGATGGGCCGCTTCACCTGTCTGCGTG 

25 AGCTCAGCATGGGCTCCTCTCTCCTTTCAGGGAGGCTGGACCAGCTGCTCAGCACCCTGCAGAGCCCCCTGGAGAGCCTGGAGT 
TGGCCTTCTGTGCTCTGCTGCCTGAGGACCTACGCTTCCTGGCACGGAGCCCACATGCTGCCCACCTCAAGAAGTTGGACCTGA 
GTGGTAACGACCTGTCTGGCAGCCAGCTGGCACCCTTCCAGGGTCTGTTGCAGGCATCAGCAGCCACACTGTTGCATCTGGAGC 
TGACTGAGTGTCAGCTCGCAGACACCCAGCTGTTGGCCACACTACCCATCCTGACTCAGTGCGCCAGTCTCCGGTACCTTGGCC 
TCTATGGCAACCCACTGTCCATGGCGGGCCTCAAGGAGCTGCTGCGGGACTCAGTGGCACAGGCTGAGCTGCGTACTGTGGTGC 

30 ACCCCTTCCCTGTGGACTGCTATGAGGGCTTGCCCTGGCCGCCGCCTGCCTCTGTCCTGCTGGAGGCCTCCATCAATGAGGAGA- 
AGTTTGCCCGCGTAGAAGCTGAGTTGCACCAGCTGCTTCTAGCCTCAGGCCGTGCCCATGTGCTCTGGACCACGGACATCTACG 
GGCGACTGGCTGCGGACTACTTCAGCCTATGATGAAGTAGCTCTGGGTGAGACACAGGCCGCCCTGCAGTCTCTTTAGGTAGGC 
AGGGCCTTTGCTGGGACCCCTGGTGGAGGCCTTCACAAAAGCACTGGTTACTGGTTTCCTGCTGGGTCTACCTTGCTTCTGGGC ■ 
ACACCTCAAGCCTCCCCTGCTTTCTGCAGTGCCCCACGCGGTTTTCCCTGCACTTGCTCCATAATTGGCTGATCATCTGTGGGC 

35 CCCGGGGCTGGATGTCAGGCCTCCATTGCCCTGCTCAGTTTGGCTGCATTTGGCTGCCGTCTGGGGTCCTGGTCCTTTGTGCAA 
ATGCTTTGGGATTCCAGTTGTGAGCTGAGAGAGATGATGGCCTCTCTGGGCCTTTCCTCTCCCTTTACTGAGAGCTCAGTGCTT 
CTGGGGTTGAAGTTGGACAGAGGCCTGCTTCAGGGAAGCTGGGAGTCCCCAGGCACTCACGCCTCTTACGTGTTCCCTACCCTG 
CCACCCAGCCACCCCACTGGGCTGGGCTCGGGCTGGAGGGGGTCATCAAGGTACACATGTGCCTGGAAGTTGAATTTGTGGCTG 
TTTTTTTTTCTGTCCACGTTGGTCACCCTTATCCTTATCTCTGCTGTCACCCCCAACATGGCCACCGGGCAACAACTGCCATCC 

40 AGCCTGTCGCCCCGCCCTTCGCGGGGCAGCCCCGTCGGCACTGCCGGCCAGTCCTTGCTCTTCCCACCTTTCGGAGGCCCAAGA 
TCCTACTGTGGCCGGCCAGGGCCAGCGAGGGACCCCCCCCATGCAGAGCTGGAGGTTGGGGTGATGTCTTTTCGGAAGAGCTTC 
AAGGGAGGTGTTGGGGCCTCCCCGGCCACCTTCCATTGCTACCCCAGGATTCCCGAGTGCAACGTTCCCGGCTCGCGCCCCACA 
CACGGCTCAGCGCACACTGCGCGGCTTCCACCTTTACTGACGGAGCATGCGCGAGGCCGCACCGGCCAATCTCCGGCGCCCACG 
. .TCATCCGCGCGCCCGGGGCCCTAGCAGTGGATCTCGTAGGCGACCGGCGGGGGCACGCGGAGTCCCGGCCCCGCCCCCTGTTCC 

45 GGGCCGCAGTCAGCGGGCGCCTCCGCCGGACCCTCGGCGAAGAGCGGCTTGGAGCGGTTGATGACGAACATCTCGTGGCCGGGC 
TCGTCGCGGAGCTCCTCTAGCTGTGCGAACGTACAGGGGCCGTCCAAGTAGTCGTTGACGAACAGCGCTCCCTCCCCCGGAGGC 
CCCCGCGCCTTTTTTCGCCTGCGGCGCCGGCGACAGATCATGGCGACCAGGAGCAGCGCCGTGAGCGCCAGCAGCGCGATGGCC 
GCCGCAATGGCCGTCTGTGTGGCCACGCCCAGGGCGCGGAAGGCCATGCTGCCCGCCTCGGGCCGGGGCTCGCTGCCGGCGGGG 
CGGGCGGCCGGAGGCGGCGGTTGCGCGGGCTGCTGCGGCTGCTGCCGGGACGCGTTGACCAGGAGCCGGAAGGGCACGCGGGCA 

50 GCGCCGCCGGCGTTGGAGGCCTCGCACTCGTACTTACCGGCGTGCGCCAGCGTGATGTTGCTGAGGAAGAGCATGCCGCTGCCC 
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GTGTCGGATGCCGAGTGTCCGCCCAGGCCCAGCAACCCGCCTTCTAGCTGGGCCTGGGCTCGCGGCCGGCCCTCGCGAGGCTGG 
GGCACCTTTCTCCAGGTCACCAATGGCTGCGGGTAGCCGGAGGCTTGGCAGGCAACCCGCAGGTCCTCACCCAGGTTGGCTGTG 
AGCTCCAGCGGCTGCACGTGGACAGAGGGCGGAATGCAGATGAGGCTGCTGTGGGATACGTCCAGGAGACTCTGGAGCGCCAGG 
CGCGGGGGCTCTGCACACATGATCTTCCTGTCCCTGGAGGTGAGCAGCCGCTGGCCGCCCTCCTTGATCCAGGCCCCCAGCCAG 
5 TGCAGGGCGCAGTCACAGCGCCATGGGTTCTCTGTGGGAGAGCAGCGTTAGGCAGGTGGCTTGAGGGTGCTGCTAAAACAGCCT 
GTGCAGTTGGGGTTTTGCAGGCCAGGACAGAGGCCTCTTTCCCACCTCCCACAGCGTTTTCACACGGAGTCCAAGGCCCTGCCA 
CCCCTTCCTTGACCCCAAGCTCCTTGGGGCGGCAGGCCCTTCACCCTCGCCCCCCTCCCCTTTAGCTCTGTGATGCCTGCCTGT 
TACAGATCACTTCTCCGTCGGTCTCTGAGAAAGCACCTGCTCCTTAAGTCTTCCTGCAACAAGTGCCACTGTTTTTAGGAACCT 
GGGCGTCCACATAGACATCTCACCAGCACTGAAACCTCACAAGTCCTCTCAGCTTTGCCTTTGGATGCCCTCTCTTGGGAATGT 

10 CCCCAGTCCTGGTCAGCTGTCTCTCTCCTTTGCAATTTTGTCTGCCTCCCCTCAGCCTAAAAGTGTGCAGAACCCTCAATTCTG 
TTAAGTCACCCTGTGGAGTTCCTGTCTTCTGTTTTCCCCAGGCAGGGTGCCTGAGCTGTATCCCCCAGCACACCCACTCCCGCA 
GCCCTCCAGTGTGGCTGCAGGCGGTGGTGCAGCCTTCCAGACTGCTGCCCAGTTGCCTGATGTCAGAGCCCCTCCACACATGAG 
CCTGCTCCCTACTGCCAACACCGTGGCCCAGACAGAGACGCTTTCCGAGGAAGAGGTACCTGTGAGGCGCAGGACTTGCAGACT 
GGCCAGGGGCTGCAGGGCCTCTCGGCTGATGGTGCCCAGCTGGTTCCTGCTGAGGTCCAGCAGTGCTAGGGAGGACAGCCCCGC 

15 TAGAGCCTGGTCCTCCAGCAGCTCAATGCTGTTTTCTTGCAGGTGAAGCTCCTGCAGTCGCTGAAGTAAGGACAGCAGATCGTG 
AGGAAAAAGGGCGCCGAGGTTGGGGGCATGTCTCTCTTCTTACCAAGCTAGACTGGGTTGCCTTTTCTAACTATTCCAGCCCTA 
CAGGGCGAGGGGCCATAATGGAGTATCCCGCCCCTTTAGACCCCAGGCGCTCACCGGCAGGTGCAAGAAGGTGAAATCCAGCAG 
CCGCGCCAGCTGGTTGCCCGCCAGGTAGAGCACGCGCAGCTGGGCCAGGCCTACGAAGGCGCCGCTGCGCAAGCCGCGCAGCCG 
GTTGCTAGTGAGCGCCAGCTCCAGCAGGCGCGGCTGCGCGCGGAAGGCGCCGGCCTCCAGGGCGCGCAGGCTGTTGTTGTGCAG 

20 GTAGAGCCGGCGCAGAGCGGCGAGTGGCGCCAGGGCTCCCGGCTCTAGGCGGGCGATGTTGTTGTCCTGCAGGAACAGTGTCTG 
CAGGCCGGGGAAAGAGGAGGCGCTTACCCCGCTGCGGGGGTCTTCCTCTCCCTGGGGGCACCCCGTCCTCCCGCAGCTCCACAC 
GGTGCCCACCTGCGTCCCTGGCGGGATTCCCAGCGGGACGACGCGCAACCGCAGGGCGCCACACTCCACCGTGGCGCTGTAGCA 
GCGGCAGGCTGCTGGGCAGCCGGCGGCGCGGAGCGGCAGTAGTAGCAGCAGCAGCGGCAGCAGTTCGGGGGCCCTCAGGGCCAT 
CTCCCGAGGCCCGGTTCCTCACCGGCCCTTCCGCGGTTCAGCCGCAGACGCGTGCCCTCCTGAAACACAGGTTGGCAGGCCAGT 

25 CTCGGCAGTCGAGAGCCAGCCAATAGATGGAATGGAGGCCTGCACCTGCGTCTAACTTTTGACACTATAAATAGGTTCAAGAAA 
CTAATAAAACGTTCTGGTTTTCTCCTTTGAC 

53 

MLQTSNYSLVLSLQFLLLSYDLFVN^ 
TAWFALSISLHVWVMNLRWKESNSF 

30 54 

ATGCTCCAGACCAGTAACTACAGCCTGGTGCTCTCTCTGCAGTTCCTGCTGCTGTCCTATGACCTCTTTGTCAATTCCTTCTCA 
GAACTGCTCCAAAAGACTCCTGTCATCCAGCTTGTGCTCTTCATCATCCAGGATATTGCAGTCCTCTTCAACATCATCATCATT 
TTCCTCATGTTCTTCAACACCTTCGTCTTCCAGGCTGGCCTGGTCAACCTCCTATTCCATAAGTTCAAAGGGACCATCATCCTG 
ACAGCTGTGTACTTTGCCCTCAGCATCTCCCTTCATGTCTGGGTCATGAACTTACGCTGGAAAAACTCCAACAGCTTCATATGG 
35 ACAGATGGACTTCAAATGCTGTTTGTATTCCAGAGACTAGCAGCAGTGTTGTACTGCTACTTCTATAAACGGACAGCCGTAAGA 
CTAGGCGATCCTCACTTCTACCAGGACTCTTTGTGGCTGCGCAAGGAGTTCATGCAAGTTCGAAGGTGA 

55 

MGGFQRGKYGTJ^EGRSEDNLSATPPALRIILVGKTGCGKSATGNSILGQPVFESKLRAQSVTRTCQVKTGTTOGRKVLVVDTP 
SIFESQADTQELYKNIGDCYLLSAPGPHVLLLVIQLGRFTAQDTVAIRKVKEVFGTGAMRHWILFTHKEDLGGQALDDYVANT 
40 DNCSLKDLWECERRYCAFNMGSVEEQRQQQAELLAVIERLGREREGSFHSNDLFLDAQLLQRTGAGACQEDYRQYQAKVEWQ 
VEKHKQELRENESNWAYKALLRVKHLMLLHYEIFVFLLLCSILFFIIFLFIFHYI 

56 

GTTCCCAATCAGTTTCCAGCCAACACCAGGGTGTCCTAGTCCGCAGAGGTGTGGGGGACACACTCCATAATCTCTACTTTTCTT 
TTTGTGCAGCTGAGTCATGGAGCTTTCAGCCCCAGCACATGGCTCCTCCTTAACTGCGTCTGCTCAACCTCCCTCAGCCCTGTG 
45 AACAGCATCCCCGCACACAGACGCAGAGCAGGACTCTCTCTGCTGCCACTTCACCTTCCTGAGAGAGGACCAGCGGCCAGAGCC 
TCAGTGACTGCCACCCTGGAGGACAGGGCACAACAACCGTTTCTGAAGAGAATGGGAGGATTCCAGAGGGGCAAATATGGAACT 
ATGGCTGAAGGTAGATCAGAAGATAACTTGTCTGCAACACCACCGGCATTGAGGATTATCCTAGTGGGCAAAACAGGCTGCGGG 
AAAAGTGCCACAGGGAACAGCATCCTTGGCCAGCCCGTGTTTGAGTCCAAGCTGAGGGCCCAGTCAGTGACCAGGACGTGCCAG 
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GTGAAAACAGGAACATGGAACGGGAGGAAAGTCCTGGTGGTTGACACGCCCTCCATCTTTGAGTCACAGGCCGATACCCAAGAG 
CTGTACAAGAACATCGGGGACTGCTACCTGCTCTCTGCCCCGGGGCCCCACGTCCTGCTTCTGGTGATCCAGCTGGGGCGTTTC 
ACTGCTCAGGACACAGTGGCCATCAGGAAGGTGAAAGAGGTCTTTGGGACAGGGGCCATGAGACATGTGGTCATCCTCTTCACC 
CACAAAGAGGACTTAGGGGGCCAGGCCCTGGATGACTATGTAGCAAACACGGACAACTGCAGCCTGGAAGACCTGGTGCGGGAG 
5 TGTGAGAGAAGGTACTGTGCCTTCAACAACTGGGGCTCTGTGGAGGAGCAGAGGCAGCAGCAGGCAGAGCTCCTGGCTGTGATT 
GAGAGGCTGGGGAGGGAGCGAGAGGGCTCCTTCCACAGCAATGACCTCTTCTTGGATGCCCAGCTGCTCCAAAGAACTGGAGCT 
GGGGCCTGCCAGGAAGACTACAGGCAGTACCAGGCCAAAGTGGAATGGCAGGTGGAGAAGCACAAGCAAGAGCTGAGGGAGAAC 
GAGAGTAACTGGGCATACAAGGCGCTCCTCAGAGTCAAACACTTGATGCTTTTGCATTATGAGATTTTTGTTTTTCTATTGTTG 
TGCAGCATACTTTTTTTCATTATTTTTCTGTTCATCTTTCATTACATTTAAATCTCTGGACCCTGGAGCACTTCTAATGTATCA 

10 CCCCATGGAGTCATTGTTCTMTMTCACCAATTCAGACTCAGATGCTCGTGGTCTATGGAGCATGCTGCTTGCTGTCTGTGCA 
GCTCCCATTTCCCCTTCTTCCTGATAGACTTGGAGCTGTGTGCCTCCACTCCAAGGCTGCCTGCCTGCTGTAAACACTATTCCA 
CTCTGTCTGCCAACAACTGCTTCAGGAATGGGCCTGAGATCCCATGCAGGTCCCTGAGAAGTGAGTAAAAGTCCGCAGAGGTGG 
GGATGGAAGATCTCTCCTTAGATAGAACCTGTCTTCCTCCCTGGCATTGTGGGGTCTGGGCGTGACACTGGGACTCTCAGCAGC 
TTTGTGCTGCCAACCTGAGATTGAAGGCAGTGCCTCAGAGCAGCACAGAGAGTTGGGGCCCCCTGAGCCCTGAGCCACCAGCCC 

15 TGCAGCCTGCCCTATCTCCGCATTTCCAGTTGTATTAGCCAATAGATTTCCTACTTATTTAAGCTATTTGAGCTCCGGGTCTCT 
TCTACCTGCATTCTAAAACATTCAAAGTAATAAAAATTTCTCCACATT 

57 ' 

ELGIPALLDPNDMVSMSVPDCLSIMTYVSQYYNHFCSPGQAGVSPPRKGLAPCSPPSVAPTPVESEDVAQGEELSSGSLSEQGT 
GQTPSSTCAACQQHVHLVQRYLADGRLYHRHCFRCRRCSSTLLPGAYENGPEEGTFVCAEHCARLGPGTRSGTRPGPFSQPKQQ 

20 HQQQLAEDAKDVPGGGPSSSAPAGAEADGPKASPEARPQIPTKPRVPGKLQELASPPAGRPTPAPRKASESTTPAPPTPRPRSS 
LQQENLVEQAGSSSLWGRLHELPVPKPRGTPKPSEGTPAPRKDPPWITLVQAEPKKKPAPLPPSSSPGPPSQDSRQVENGGTE 
EVAQPSPTASLESKPYNPFEEEEEDKEEEAPMPSLATSPALGHPESTPKSLHPWYGITPTSSPKTKKRPAPRAPSASPLALHA 
SRLSHSEPPSATPSPALSVESLSSESASQTAGAELLEPPAVPKSSSEPAVHAPGTPGNPVSLSTNSSLASSGELVEPRVEQMPQ 
ASPGLAPRTRGSSGPQPAKPCSGATPTPLLLiVGDRSPVPSPGSSSPQLQVKSSCKENPFNRKPSPAASPATKKATKGSKPVRPP 

25 APGHGFPLIKRKVQADQYIPEEDIHGEMDTIERRLDALEHRGVLLEEKLRGGLNEGREDDMLVDWFKLIHEKHLLVRRESELIY 
VFKQQNLEQRQADVEYELRCLLNKPEKDWTEEDRAREKVLMQELVTLIEQRNAIINCLDEDRQREEEEDKMLEAMIKKKEFQRE 
AEPEGKKKGKFKTMKMLKLLGNKRDAKSKSPRDKS 

58 ' 

GGAGCTGGGGATCCCCGCTCTCCTGGACCCCAATGACATGGTCTCCATGAGCGTCCCTGACTGCCTCAGCATCATGACCTATGT 

30 GTCCCAGTATTACAACCACTTCTGCAGTCCTGGCCAAGCTGGTGTCTCGCCACCCAGAAAGGGCCTTGCACCCTGTTCCCCGCC 
GTCTGTAGCACCCACTCCAGTGGAATCAGAAGATGTGGCTCAGGGCGAGGAGCTCTCCTCAGGCAGCCTGTCAGAGCAGGGCAC 
CGGCCAGACCCCCAGCAGCACGTGCGCAGCCTGCCAGCAGCATGTGCACTTGGTGCAGCGCTACCTGGCTGACGGCAGGCTGTA 
CCATCGCCACTGCTTCCGGTGTCGGCGGTGCTCCAGCACCCTGCTCCCTGGGGCTTATGAGAATGGGCCTGAGGAGGGCACCTT 
TGTGTGTGCAGAACACTGTGCCAGGCTGGGCCCGGGGACACGGTCGGGGACCAGGCCTGGGCCCTTCTCACAGCCAAAGCAGCA 

35 GCACCAGCAGCAACTCGCAGAAGATGCCAAGGATGTTCCAGGAGGCGGCCCCAGCTCCAGTGCTCCTGCAGGGGCTGAGGCCGA 
TGGACCCAAGGCCAGCCCTGAGGCCCGGCCGCAGATCCCTACCAAGCCCCGGGTTCCTGGCAAACTACAGGAGCTGGCCAGCCC 
CCCTGCGGGCCGCCCCACCCCTGCCCCCAGGAAGGCCTCTGAGAGCACCACCCCAGCACCCCCCACGCCCCGGCCCCGCTCCAG 
TCTGCAGCAGGAGAACCTGGTGGAGCAGGCTGGCAGCAGCAGCCTGGTGAACGGGAGACTGCACGAACTGCCTGTCCCCAAGCC 
GAGGGGGACACCGAAGCCGTCCGAGGGGACACCAGCCCCCAGGAAGGACCCCCCATGGATCACGCTGGTGCAGGCAGAACCAAA 
. 40 GAAGAAGCCAGCCCCACTTCCCCCAAGCAGCAGCCCGGGGCCACCAAGCCAGGACAGCAGGCAGGTGGAGAATGGAGGCACCGA 
GGAGGTGGCCCAGCCGAGCCCAACGGCCAGCCTGGAGTCCAAACCCTATAACCCCTTTGAGGAGGAGGAGGAGGACAAGGAGGA 
AGAGGCTCCAGCTGCACCCAGCCTGGCCACCAGCCCTGCCCTGGGCCACCCGGAGTCCACACCCAAGTCCCTGCACCCCTGGTA 
CGGCATCACCCCTACCAGCAGCCCCAAGACAAAGAAGCGCCCTGCCCCGCGCGCACCCAGCGCGTCCCCACTGGCTCTCCACGC 
CTCCCGCCTCTCGCACTCGGAGCCGCCCTCGGCCACACCATCGCCAGCGCTCAGCGTGGAGAGCCTGTCGTCTGAGAGCGCCAG 

45 CCAGACTGCAGGTGCAGAGCTTCTGGAGCCGCCAGCTGTGCCCAAGAGCTCCTCAGAGCCTGCTGTCCATGCCCCTGGTACCCC 
TGGAAACCCTGTCAGCCTCTCTACCAACTCCTCCCTGGCCTCCTCTGGGGAACTAGTGGAGCCTAGAGTGGAACAAATGCCTCA 
AGCCAGCCCTGGCCTTGCCCCCAGGACCAGGGGCAGCTCAGGTCCCCAGCCAGCCAAGCCCTGCAGTGGCGCCACCCCAACGCC 
TCTCTTGTTGGTTGGAGACAGGAGCCCGGTGCCTTCCCCTGGAAGCTCGTCCCCACAGCTGCAGGTAAAGTCCTCCTGCAAGGA 
GAATCCTTTTAACCGGAAGCCATCACCTGCAGCGTCCCCAGCCACAAAGAAGGCCACCAAGGGATCCAAGCCAGTGAGGCCACC 

50 TGCCCCTGGACACGGCTTTCCACTCATCAAACGCAAGGTCCAGGCTGACCAGTACATCCCTGAGGAGGACATCCATGGAGAGAT 
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GGATACCATTGAGCGCCGGCTGGATGCCCTGGAGCACCGTGGGGTGCTGCTGGAGGAGAAGCTGCGTGGCGGCCTGAATGAGGG 
CCGTGAGGATGACATGCTGGTGGACTGGTTCAAGCTCATCCACGAGAAGCACCTACTGGTGCGGCGAGAGTCCGAGCTCATCTA 
TGTCTTCMGCAGCAGAACCTGGAGCAGCGCCAGGCTGATGTCGAGTATGAGCTCCGGTGCCTCCTCAATAAGCCAGAAAAGGA 
CTGGACGGAGGAGGACCGGGCCCGGGAGAAGGTGCTGATGCAGGAGCTTGTGACCCTCATTGAGCAGCGCAACGCTATCATCAA 
5 CTGCCTGGATGAGGACCGGCAGAGGGAGGAAGAGGAAGACAAGATGTTGGAAGCCATGATCAAGAAGAAAGAGTTCCAGAGGGA 
GGCTGAACCTGAGGGCAAGAAGAAGGGGAAGTTCAAGACCATGAAGATGTTGAAACTGGTAGGAAACAAACGTGATGCCAAGAG 
CAAGTCCCCCAGAGACAAGAGCTAACAGCACGAGAAGCCAGTTGGGGACTGCCCCCTCCTGGAGCAGCTCCTGGGCTGTGCTCT 
GTTTGAAGGGGGCGCCCTGCTCCCCTCAGATCAGTCAGGAGGAAGATGACTAAGGGGAGGGATCCTCTGGGTGATGGCCTCTTC 
CTCCTCAGGGACCTCTGACTGCTCTGGGCCAAAGAATCTCTTGTTTCTTCTCCGAGCCCCAGGCAGCGGTGATTCAGCCCTGCC 

10 CAACCTGATTCTGATGACTGCGGATGCTGTGACGGACCCAAGGGGCAAATAGGGTCCCAGGGTCCAGGGAGGGGCGCCTGCTGA 
GCACTTCCGCCCCTCACCCTGCCCAGCCCCTGCCATGAGCTCTGGGCTGGGTCTCCGCCTCCAGGGTTCTGCTCTTCCAGGCAG 
GCCAGCAAGTGGCGCTGGGCCACACTGGCTTCTTCCTGCCCCATCCCTGGCTCTGAGTCTCTGTCTTCCTGTCCTGTGCAGGCG 
CCCTTGGATCTCAGTTTCCCTCACTCAGGAACTCTGTTTCTGAAGTCTTCAGTTAAGTTTGAGTTTATGACTGAGTGGCCTGTA 
• CTGTCAGACGTGAATGGGCCTGACGGGCAAATCCATCCCTCTCTCCCTCACAGTTCCAGGAGCGGCTTCCCTCGTCTCCCCTTA 

15 CTCCACAGGGAGCCTCCCTTGCCAGGACCAGGGCTGCGACGGCCATGCTGGGGCAGGTGAGTGCTCTGTTAGCTGCTCCCAGTG 
CTGTCCCCAGGCTGCAGTTCTGGTCCCTGGTTGTCAGGTAGGAAGGGTGCACTTGAAGCAGGTGCTCATCTCGGTTCCTTAACG 
TTTATAGTCTGACCCCTCACTTAGGCTTTCCTCTGCCACCCCGGTCCAGGGAAGAGGCTCGCTCCCGCCCATGGTCATCACTGG 
TCTGTCTGCTCTGTTGTCTGTTCTTTCCCTGACTCCCTCCCACCGAAGGCCTGATGGCTACTCACCCCTCTGGGATGGCTATGG 
GAGAGGAGGAGTGATGGGGACCGCCACCTTTTCTGCAGGAAATGTGCCCAGCAGCTCTTGGTCAAAGCACTGTTGCTATAAGCT 

20 ATCTCTGGGATGCGTCTAGGCCCCCTTCCCTCTACACACCTCTGGGAAAAGATTACACTGTATTAACTCTCGAGGAGTTTCCTC 
ACCAATAAACAGACAACCTCAACTGCCAGTGCCCTGCAGCCTCGGGCCACAGCGGCAGCCTTGTTTGCCTTCCCACCTGCCTCT 
GCCACACCTGGTGGCTGAACATCTCTGGTCGCCCAGAGGCCATGTTGGGGCCATCCTCCAAGAGGGATCTCTGCCCTCACCGCC 
TGCCACTGGGCAGGATCCCTTTCCTCTGCAGGGAGAGGTGGCTCCTCGGCCATGCAGCCCCTGGCAGGCTCCTTCTAAACATGC 
CTGTTGACCTGGAGCTGGCGCCACCAACTCCAGGGCCTTTCCAGGGCCAGACAGGTAACACGCATGAACCCGAGTGACAGCTCT 

25 GACGGGCTGTTTCGGTGTCAGGAGACAAAGCTGGCAGGGGCAGGGGTGAACTGGAGGCAAGTCAAGTCACCTGTGGCCTGTGGG 
GCTGAATGTGGGCCCGGTGTTGCCAGATCCTTTGTCATAAGAAGCTAGAAATCCAGATTTTATGTGTGTGTAATTTGTAAATGC 
TGAAAGCTAGCCTGAATTTTTTTTTTTTTTTTGAGACAGAGTCTCGCTCTGTCGCCCAGGCTGGAGTGCAGTGGCGCGATCTCA 
GCTCACTGCAGGCTCCGCCTCCTGGGTTCACGCCATCCTCCTGCCTCGGCCTCCTGAGCAGCTGGGACTACAGGCGCATGCTAC 
GACGCCTGGCTAATTTTTTGTATTTTTAGTAGAGACGGGGTTTCACCGTGTTAACCAGGATGGTCTCGATCTCCTGACCTTGTG 

30 ATCCACCCACCTTGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCACGCCCGGCCACTAGCCTGAATTTCAATCAAGG 
GTTGGCTGATACTGTGTGTCCAGGGTGGACTGGATTTGTCCTGGGGGGTTCTCTGGTTTGCTGCCTCCTGACCACATGATGGGG 
CCTTCGAGGTCGAGGACAACTGTTCCCATTAGATTGCACCCTCTGCCCTCAGGTTCTTGAGGGTGTGTGGACACAGAGGCTTTC 
CATGGGATGTGCCTGAGCCGGCCCTTGATTGGGGCCTCACCATTTACAGGGCCGTTTTATTCTGCAAACCGAAACTTGGGTCAT 
GTGACCTGATGGGATTATGGGACTCCCTCCAGGTGCCCGAGACAAGGTTGATATTTCCAAAATATTTTGGTGATTTAGTGGGAC 

35 AAGCAAATGACAGAATACCGGAGAAGGCAGGGATCGTGGGTGTCAGGAGCCAGAGGGGAGGGGGACAGATGTGCTGTGTACAGG 
ACAAGGTGTCAGGTGACTCCTTCCCAGCAGGGCCTCGCAGATGCACAAGCACGGAGCTGGTGGGTTTTGCCCAAGAAAGGTCAC 
GCGGCACATGCAGGGATTGGAACTCCCAGGCCAGGGCTCTAGGTCGCTCCCACCTTTTCATGTTTCTTTCTGTGGCCATGGGTA 
TAGTGGAAAGACATAAAGCTAAAGCCAACTTTTAATCCTGAATGCACTGCTTGCCAGGTAAATGCCCTTGGTTGTGGTATCTTG 
TTGAGACTTAGTTTTCACAGAGGGATAATGAACCGTTGCAGAGGTTTATTGAGATCATTAACAGAGTGGAATTCAGCACCTGCC 

40 ACTGCACTCCAGCCTGGGCGACAGAGCAAGACTCAGTCTCAAAAAAAAAAAAAATTAGCTGGGCAGGGCATGGTGATGGTGCCT 
GTAATCCTAGCTACTTGGGAGGCTGAGGCATGAGAATTGCCTGAACCCAGGAGGTGGAGGTTGCAGTGAGCCGAGATCGTGCCA 
CTGTACTCCAGCCTGGGTGACAGCGCGAGACTCCGTCTCAAAAAAAGCTGGGTGTGGGGAACACCTGTGGTCCCAGCTATTCTG 
GAGACTGAGGCAGGAGGATTGCTTGAGCTCAGGAGTTCTGGCTGCAGTGAGCTATGATCATGCCACTGTATTACAGAATGGGTG 
ACAGAATGAGAGCGACACTGTCTCAAAAAAAAAAAAAAAAAAAGGCCGGGAGCGGTCGTTTGTGCCTGTAATCCCAACACTTTG 

45 GGAGGCCAGGGTGGGCGGATCACTTGAGGCCAGGAGTTCAAGACCAGCCTGGGCAACATGGTGAATCCCCATCTCTACTAAAAA 
AATTAACTGGACATGGTGGTGGACACTTGTAATCCCAGCTACTCAGGAGGCTGACACATGAGAATTGCTTGAACCCGGGAGGCG 
GAGGTTACAGTGAGCCGAGATAGCACCACTGCACTCCAACCTGGGCACAGAGTAAGGCTCTGTCTTT 

59 

MSEGPREPESEGIKLSADVKPFVPRFAGLWAWLESSEACVFPSSAATYYPFVQEPPVTEQKIYTEDMAFGASTFPPQYLSSE 
50 ITLHPYAYSPYTLDSTQNWSVPGSQYLYNQPSCYRGFQTVKHRNENTCPLPQEMKALFKKKTYDEKKTYDQQKF 
SSEIKSARGSHHLSIYAENSLKSDGYHKRTDRKSRIIAKW 
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SLLREWKPAAVLSKGEIWK^ 

VSIPSSEALSSDPSYNKEKHIIHPTQKSKASQGSDLEQNEASRKNKKKKEKSTSKYEVLTVQEPPRIEDAEEFPNLAVASERRD 
RIETPKFQSKQQPQDNFKNWKKSQLPVQL^^ 

HNPLDSSAPLMKKGKQREIPKAKKPTSLKKIILKERQERKQRLQENAVSPAFTSDDTQDGESGGDDQFPEQAELSGPEGMDELI 
5 STPSVEDKSEEPPGTELQRDTEASHLAPNHTTFPKIHSRRFRDYCSQMLSKEVDACVTDLLKELVRFQDRMYQKDPVKAKTKRR 
LVLGLREVLKHLKLKKLKCVIISPNCEKIQSKGGLDDTLHTIIDYACEQNIPFVFALNRKALGRSLNKAVPVSWGIFSYDGAQ 
DQFHKMVELTVAARQAYKTMLEWQQELVGEPRPQAPPSLLTQGPSCPAEDGPPALKEKEEPHYIEIWKKHLEAYSGCTLELEE 
SLEASTSQMMNLNL 

60 

10 CCAAGCCGACGGCCCGCTGCTGGCCTCCGTGACGCGGCCTCCTCCGCGCCTCGCGGCATGGCGTCGGAGGGGCCGCGGGAGCCC 
GAAAGCGAGGGCATCAAGTTATCAGCAGATGTCAAACCATTTGTCCCCAGATTTGCCGGGCTCAATGTGGCATGGTTAGAGTCC 
TCAGAAGCATGTGTCTTCCCCAGCTCTGCAGCCACATACTATCCGTTTGTTCAGGAACCACCAGTGACAGAGCAGAAAATATAT 
ACTGAAGACATGGCCTTTGGAGCTTCAACTTTTCCACCTCAGTATTTATCTTCTGAGATAACTCTTCATCCATATGCCTATTCT 
CCTTATACCCTTGACTCCACACAGAATGTTTACTCAGTGCCTGGCTCCCAGTATCTTTATAACCAACCCAGTTGTTACCGAGGT 

15 TTTCAAACAGTGAAGCATCGAAATGAGAACACATGCCCTCTCCCACAAGAAATGAAAGCTCTGTTTAAGAAGAAAACCTATGAT 
GAGAAAAAAACGTATGATCAGCAAAAGTTTGACAGTGAMGGGCTGATGGAACTATATCATCTGAGATAAAATCAGCTAGAGGT 
TCACATCATTTGTCCATTTACGCTGAGAATAGTTTGAAATCAGATGGTTACCATAAGCGAACAGACAGGAAATCCAGAATCATT 
GCAAAAAATGTATCTACCTCCAAACCTGAGTTTGAATTTACCACACTGGACTTTCCTGAACTGCAAGGTGCAGAGAACAATATG 
TCAGAGATACAGAAGCAACCCAAGTGGGGACCTGTCCACTCTGTCTCTACCGACATTTCTCTTCTAAGAGAAGTAGTAAAACCA 

20 GCTGCAGTGTTATCAAAGGGTGAMTAGTGGTGAAAAATAACCCAAATGMTCTGTAACTGCTAATGCCGCTACCAATTCTCCT 
TCATGTACAAGAGAGTTATCTTGGACACCAATGGGTTATGTTGTTCGACAGACATTATCTACAGAACTGTCAGCAGCCCCTAAA 
AATGTTACTTCTATGATAAACTTAAAGACCATTGCTTCATCAGCAGATCCTAAAAATGTTAGTATACCATCTTCTGAAGCTTTA 
TCTTCGGATCCTTCCTACAACAAAGAAAAACACATTATTCATCCTACCCAAAAGTCTAAAGCATCACAAGGTAGTGACCTTGAA 
CAAAATGAAGCCTCAAGAAAGAATAAGAAAAAGAAAGAAAAATCTACATCAAAATATGAAGTCCTGACAGTTCAAGAGCCTCCA 

25 AGGATTGAAGATGCCGAGGAATTTCCCAACCTGGCAGTTGCATCTGAAAGAAGAGACAGAATAGAGACACCGAAATTTCAATCT 
AAGCAGCAGCCACAGGATAATTTTAAAAATAATGTAAAGAAGAGCCAGCTTCCAGTGCAGTTGGACTTGGGGGGCATGCTGACA 
GCCCTGGAGAAGAAGCAGCACTCTCAGCATGCAAAGCAGTCCTCCAAACCAGTGGTAGTCTCAGTTGGAGCAGTGCCAGTCCTT 
TCCAAAGAATGTGCATCAGGGGAGAGAGGCCGCCGCATGAGTCAAATGAAGACCCCGCACAATCCCTTGGACTCCAGCGCCCCA 
CTGATGAAGAAAGGGAAGCAGAGGGAGATCCCCAAGGCCAAGAAGCCAACCTCACTGAAGAAGATTATTTTGAAAGAACGGCAA 

30 GAGAGAAAGCAGCGTCTCCAAGAAAATGCTGTGAGTCCAGCTTTTACCAGTGATGACACACAAGATGGAGAGAGTGGTGGTGAT 
GACCAGTTTCCCGAGCAGGCAGAGCTGTCAGGGCCAGAGGGGATGGACGAACTGATCTCCACTCCTTCGGTTGAGGACAAGTCT 
GAAGAGCCACCAGGCACAGAGCTCCAGAGGGACACAGAGGCCTCCCACCTTGCTCCCAATCACACCACCTTCCCTAAGATCCAC 
AGCCGCAGATTCAGGGATTACTGCAGCCAGATGCTTAGTAAAGAAGTGGATGCTTGTGTTACCGACCTACTCAAAGAACTGGTC 
CGTTTCCAAGACCGTATGTACCAGAAAGATCCAGTCAAGGCCAAGACTAAACGTCGACTTGTGTTGGGGTTGAGGGAGGTTCTC 

35 AAACACCTGAAGCTCAAAAAACTGAAATGTGTCATTATTTCTCCCAACTGTGAGAAGATACAGTCAAAAGGTGGGCTGGATGAC 
ACTTTGCACACAATTATTGATTATGCCTGTGAGCAGAACATTCCCTTTGTGTTTGCTCTCAACCGCAAAGCTCTGGGGCGCAGT 
TTGAATAAGGCAGTTCCTGTCAGTGTGGTGGGGATCTTCAGCTATGATGGGGCCCAGGATCAGTTCCACAAGATGGTTGAGCTG 
ACAGTGGCGGCCCGACAGGCGTACAAGACCATGCTGGAGAATGTGCAGCAGGAGCTGGTGGGAGAGCCCAGGCCTCAGGCACCT 
CCCAGCCTACTCACACAGGGCCCCAGCTGCCCTGCAGAAGATGGCCCCCCAGCCCTGAAAGAAAAAGAAGAGCCACACTACATT 

40 GAAATCTGGAAAAAACATCTGGAAGCATACAGTGGATGTACCCTGGAGCTAGAAGAATCCTTGGAGGCTTCAACCTCTCAAATG 
ATGAATTTGAATTTATGAGAGTTCTTGCCTGTGTGTCTGTATTTTGGGTAAGGAGGGGAGGTCTGAAAAAGACTTTGGGGCTTT 
TTCTTCTGTTTTTCATGACAATGTAATTTGTGTAACTGTTGAATCTGGAAATTGATCAGCATTAAAGGGCACATGAAGCAGTGT 
CTGCAGGCGTTCAGTGCTGCGGAGCCTGTTAAAGGTCACTCAGATGTGCAGGTGTTAATCTTCTCTAAAAGCCTGGTGATACAG 
CTCTGGCTTTCTGAGCACACTACGGATCTGGAAAATACTGGAAAATGTGATACTTAGAATACTTTGGCTGCTAAGGAAACTTCC 

45 TCTCCATTGCAGAATAGCTGAGCCAAGTGAGTGAGTTTGCAGAAAGCAGGTGGTGAGCTCCTGCCTGCTGGAGGTTGCCATGGA 
GGGCCATTCCTGCCCGGCAACAGCACCGTCCTGCAGGGAGCCACTTGGCAGAAGGGTGCAGGGCTGCTGGTGTCAGAGCAAGAG 
GGCTACAGGGAAAGGGCCCTTTCTCAGGGGATGTAGCTTTTTTAAAAGATTTGGGAACACTTGGAGGATTTGCTAAAATGAGCC 
TCAGAAGGAAAATTGGTTTTCTAACCTGTGACTTTTTGAAATGAATTATTCCTTTCAGTCTTTATTTTTCAAAGAAACAATGTG 
TATTGAAGTACCTAGATTTGTTTGATAATCAACAAATCTTTCCTTTTTCAATGAACATATTCTGAATGTGGTTTCTGTCTTAGA 

50 CCAGGAGGACAGAGTTTGCTTTCATATTTTCCCTGTAAGTAAGAGGGCTTATTTATTTTAAATAAAGAGTAATTATTAAAAAAA 
AAAAAAAAAAAAA 
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CTCTTTGCTTTTGCATGGCTTGGCTTAGTATCCAAGGTATATTAGGGCCACTTGAAAGCATGAAGACCAGTTATATAGGGAACA 
GGTTTCTCTCAGTGGCACATTTTGCTTTTTCTGAGCCCCAMTACATTGCCTGGGCATGAACATTGTTACCGTAMTTGCACAT 
GGTCATGGACTGMTTATGTGACTTTAAAGGATGTAACTGCCCAACATTTGCAGATTCTGGGTGGTCTATGTGACCATTTGTCT 
5 CGTATCCAAAAACCCCGGGGCTATTGGAACCCTTCCAACACTTTTTCCTTTGTCATAGACAAGTTTATATATAACTTACCAAGA 
TGTTGGCTGTGCTGGTGTATTGCCAGACAGCTGTCTTTTGGTTCCCATTCCAAATGTGCTGCTGTCCTTCTTTGCATTTCACAA 
TATCAAAGAAACCACCACCCTTCTTCCTAACAGCATTTTATGCCTTTTATTCCACATTAAATGGGAATTGTGGCTACTTAGGAG 
TGCCCCTCCMTTMTTACATGTGTCCMGAATAATCCAAGCTAGAGACACMGGTGGGAAAACATTTCAAAAAAAAAAAGTCC 
TCTTAAGGCCAGTAATTTATCTGAAAAGGTATTTTATCACACCTTGACACCTTATATATGAGCCTATTAGGAGCTGCAGGTGGT 

10 TTCATAGGGTAAAATCCAAGAAAAGAGAAGGATGTGTGGGGTTTCTATTAGAAGATAATTTTGTTCTCATTTTACCTTTTCTTT 
TATGATCCTTCTCTGCTAGAACAGGTTAATTCTCCAAATTTGTTTTGTTTTGTTTTGTTATTTTTTAGGGAACTCTTTTGCAAA 
AGCAATGGTCGGATGTAAATAACATTTAAAGTATAGTGCACATAACTTCCCCGGACTGTTCCAATCTGATAATTTGTAAATGCT 
TTAGAGTTTTTTTAATTAACACTTGTGTTGCTAAATTCTATTTATGTAAGTCTGCTAAAGTTTTTTAGCCCACTTAAAACTTAA 
GACAACCATTTAAAATAATGGATGGGTTACTATGAGCAATTTCGCTTTCAGAACCCCCTTGTTTTAGTATATGAAAAAGGCTAA 

1 5 TGCGCATTAATGAGGTTGAAGAGACTATGAGAAATATGTATAGTGTATATTTTAAAACAGCTTTGCTTGTATTGTGAAGATTTA ' 
AAAACAAACTTGAGATTTTTAACGTAACTATTAACACAGTTTTAACATAAGTTATCCCACTGGGTTTAAGAGCATCTTGAATGT 
ATAATCCTTTTTGTAACCCAGGTTGGTTTCTACTTTTACCAGTCACCCAAACATATTTATGTTTTTAGTTTTATGTACTCATTT 
CCCTTTGTTTTCCTCAAACAGCATGATTTTTTTGCACATGTAGAAATTTTTTAAAAGAAAGAAATTAGTACATCATTTTCTCTG 
GATTTTCTTCACTTCCCTCTTCCTTTCTACTAACTCCTTCCTTAAAGGCCATATCACTCCATTTGCATTATTTGTGCAAATGCC 

20 AGGGTTGGTTTTTATTTTTATTTTTGCTATTTACCTAAAAAAAAAAAAAAAAA 

63 " 

MEVSRRKAPPRPPRPAAPLPLLAYLLALAAPGRGADEPVWRSEQAIGAIAASQEDGVFVASGSCLDQLDYSLEHSLSRLYRDQA 
GNCTEPVSLAPPARPRPGSSFSKLLLPYREGAAGLGGLLLTGWTFDRGACEVRPLGNLSRNSLRNGTEWSCHPQGSTAGVVYR 
AGRmRWYLAVAATWLPEPETASRCNPMSDHDTAIALKDTEGRSLATQELGRLKLCEGAGSLHFVDAFLMGSIYF 
25 YTSGAATGWPSMARIAQSTEVLFQGQASLDCGHGHPDGRRLLLSSSLVEALDVWAGVFSAAAGEGQERRSPTTTALCLFRMSEI 
QARAKRVSWDFKTAESHCKEGDQPERVQPIASSTLIHSDLTSWGTVVMNRTVLFLGTGDGQLLKVILGENLTSNCPEVIYEIK 
EETPVFYKLVPDPVKNIYIYLTAGKEVRRIRVANCNKHKSCSECLTATDPHCGWCHSLQRCTFQGDCVHSENLENWLDISSGAK 
KCPKIQIIRSSKEKTTVTMVGSFSPRHSTC 

FTNCSSLKECPACVETGCAWCKSARRCIHPFTACDPSDYERNQEQCPVAVEKTSGGGRPKENKGNRTNQALQVFYIKSIEPQKV 
30 STLGKS3WIVTGMFTRASNITMILKGTSTCDKDVIQVSHVLNDTHMKFSLPSSRKEMKDVCIQFDGGNCSSVGSLSYIALPHC 
SLIFPATTWISGGQNITMGRNFDVIOT^^ 

REDPRFTGYRVESEVDTELEVKIQKENDNFNISKKDIEITLFHGEMGQLNCSFENITRNQDLTTILCKIKGIKTASTIANSSKK 
VRVKLGNLELWEQESVPSTWYFLIVLPVLLVIVIFAAVGVTRHKSKELSRKQSQQLELLESELRKEIRDGFAELQMDKLDWD 
SFGTVPFLDYKHFALRTFFPESGGFTHIFTEDMHNRDANDKNESLTALDALICNKSFLVTVIHTLEKQKNFSVKDRCLFASFLT 
35 lALQTKLWLTSILEVLTRDLMEQCSMQPK^ 
TCKALYTLNEDWLLWQVPEFSTVALN^ 

QMGTRQKELLDIDSSSVILEDGITKLNTIGHYEISNGSTJKVFKKIANFTSDVEYSDDHCHLILPDSEAFQDVQGKRHRGKHKF 
KVKEMYLTKLLSTKVAIHSVLEKLFRSIW^ 

VFDIKKTPHIDGCLSVIAQAFMDAFSLTEQQLGKEAPTNKLLYAKDIPTYKEEVKSYYKAIRDLPPLSSSEMEEFLTQESKKHE 
40 NEFNEEVALTEIYKYIVKYFDEILNKLERERGLEEAQKQLLHVKVLFDEKKKCKWM 

64 

GCGAGGAGGAAACGGTGCCGGAGCGCGCAGGGCTTGCTGCCGCCACCGCCGCTGCACAGGCTGGCGGAGCGAGCCTGCCGCGCG 
CCGCCCTCCCCGCTCTCCTTCCTGGGCGAGCTGCGGGGATGGGGCGGCCGCGGGAGCCCGAGCGCGCGCAGGAACCGCCGCCGC 
CGCCGCCCGCGTCTCCGTTGCCGCGCGCCTGAGCCGC^ 
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GGCCGGAACAACCGCTGGTACCTGGCGGTGGCCGCCACCTACGTGCTGCCTGAGCCGGAGACGGCGAGCCGCTGCAACCCCGCG 
GCATCCGACCACGACACGGCCATCGCGCTCAAGGACACGGAGGGGCGCAGCCTGGCCACGCAGGAGCTGGGGCGCCTCAAGCTG 
TGCGAGGGCGCGGGCAGCCTGCACTTCGTGGACGCCTTTCTCTGGAACGGCAGCATCTACTTCCCCTACTACCCCTACAACTAT 
ACGAGCGGCGCTGCCACCGGCTGGCCCAGCATGGCGCGCATCGCGCAGAGCACCGAGGTGCTGTTCCAGGGCCAGGCATCCCTC 
5 GACTGCGGCCACGGCCACCCCGACGGCCGCCGCCTGCTCCTCTCCTCCAGCCTAGTGGAGGCCCTGGACGTCTGGGCGGGAGTG 
TTCAGCGCGGCCGCTGGAGAGGGCCAGGAGCGGCGCTCCCCCACCACCACGGCGCTCTGCCTCTTCAGAATGAGTGAGATCCAG 
GCGCGCGCCAAGAGGGTCAGCTGGGACTTCAAGACGGCCGAGAGCCACTGCAAAGAAGGGGATCAACCTGAAAGAGTCCAACCA 
ATCGCATCATCTACCTTGATCCATTCCGACCTGACATCCGTTTATGGCACCGTGGTAATGAACAGGACTGTTTTATTCTTGGGG 
ACTGGAGATGGCCAGTTACTTAAGGTTATTCTTGGTGAGAATTTGACTTCAAATTGTCCAGAGGTTATCTATGAAATTAAAGAA 

10 GAGACACCTGTTTTCTACAAACTCGTTCCTGATCCTGTGAAGAATATCTACATTTATCTAACAGCTGGGAAAGAGGTGAGGAGA 
ATTCGTGTTGCAAACTGCAATAAACATAAATCCTGTTCGGAGTGTTTAACAGCCACAGACCCTCACTGCGGTTGGTGCCATTCG 
CTACAAAGGTGCACTTTTCAAGGAGATTGTGTACATTCAGAGAACTTAGAAAACTGGCTGGATATTTCGTCTGGAGCAAAAAAG 
TGCCCTAAAATTCAGATAATTCGAAGCAGTAAAGAAAAGACTACAGTGACTATGGTGGGAAGCTTCTCTCCAAGACACTCAAAG 
TGCATGGTGAAGAATGTGGACTCTAGCAGGGAGCTCTGCCAGAATAAAAGTCAGCCCAACCGGACCTGCACCTGTAGCATCCCA 

15 ACCAGAGCAACCTACAAAGATGTTTCAGTTGTCAACGTGATGTTCTCCTTCGGTTCTTGGAATTTATCAGACAGATTCAACTTT 
ACCAACTGCTCATCATTAAAAGAATGCCCAGCATGCGTAGAAACTGGCTGCGCGTGGTGTAAAAGTGCAAGAAGGTGTATCCAC 
CCCTTCACAGCTTGCGACCCTTCTGATTATGAGAGAAACCAGGAACAGTGTCCAGTGGCTGTCGAGAAGACATCAGGAGGAGGA 
AGACCCAAGGAGAACAAGGGGAACAGAACCAACCAGGCTTTACAGGTCTTCTACATTAAGTCCATTGAGCCACAGAAAGTATCG 
ACATTAGGGAAAAGCAACGTGATAGTAACGGGAGCAAACTTTACCCGGGCATCGAACATCACAATGATCCTGAAAGGAACCAGT 

20 ACCTGTGATAAGGATGTGATACAGGTTAGCCATGTGCTAAATGACACCCACATGAAATTCTCTCTTCCATCAAGCCGGAAAGAA 
ATGAAGGATGTGTGTATCCAGTTTGATGGTGGGAACTGCTCTTCTGTGGGATCCTTATCCTACATTGCTCTGCCACATTGTTCC 
CTTATATTTCCTGCTACCACCTGGATCAGTGGTGGTCAAAATATAACCATGATGGGCAGAAATTTTGATGTAATTGACAACTTA 
ATCATTTCACATGAATTAAAAGGAAACATAAATGTCTCTGAATATTGTGTGGCGACTTACTGCGGGTTTTTAGCCCCCAGTTTA 
AAGAGTTCAAAAGTGCGCACGAATGTCACTGTGAAGCTGAGAGTACAAGACACCTACTTGGATTGTGGAACCCTGCAGTATCGG 

25 GAGGACCCCAGATTCACGGGGTATCGGGTGGAATCCGAGGTGGACACAGAACTGGAAGTGAAAATTCAAAAAGAAAATGACAAC 
TTCAATATTTCCAAAAAAGACATTGAAATTACTCTCTTCCATGGGGAAAATGGGCAATTAAATTGCAGTTTTGAAAATATTACT 
AGAAATCAAGATCTTACCACCATCCTTTGCAAAATTAAAGGCATCAAGACTGCAAGCACCATTGCCAACTCTTCTAAGAAAGTT 
CGGGTCAAGCTGGGAAACCTGGAGCTCTACGTCGAGCAGGAGTCAGTTCCTTCCACATGGTATTTTCTGATTGTGCTCCCTGTC 
TTGCTAGTGATTGTCATTTTTGCGGCCGTGGGGGTGACCAGGCACAAATCGAAGGAGCTGAGTCGCAAACAGAGTCAACAACTA 

30 GAATTGCTGGAAAGCGAGCTCCGGAAAGAGATACGTGACGGCTTTGCTGAGCTGCAGATGGATAAATTGGATGTGGTTGATAGT 
TTTGGAACTGTTCCCTTCCTTGACTACAAACATTTTGCTCTGAGAACTTTCTTCCCTGAGTCAGGTGGCTTCACCCACATCTTC 
ACTGAAGATATGCATAACAGAGACGCCAACGACAAGAATGAAAGTCTCACAGCTTTGGATGCCCTAATCTGTAATAAAAGCTTT 
CTTGTTACTGTCATCCACACCCTTGAAAAGCAGAAGAACTTTTCTGTGAAGGACAGGTGTCTGTTTGCCTCCTTCCTAACCATT 
GCACTGCAAACCAAGCTGGTCTACCTGACCAGCATCCTAGAGGTGCTGACCAGGGACTTGATGGAACAGTGTAGTAACATGCAG 

35 CCGAAACTCATGCTGAGACGCACGGAGTCCGTCGTCGAAAAACTCCTCACAAACTGGATGTCCGTCTGCCTTTCTGGATTTCTC 
CGGGAGACTGTCGGAGAGCCCTTCTATTTGCTGGTGACGACTCTGAACCAGAAAATTAACAAGGGTCCCGTGGATGTAATCACT 
TGCAAAGCCCTGTACACACTTAATGAAGACTGGCTGTTGTGGCAGGTTCCGGAATTCAGTACTGTGGCATTAAACGTCGTCTTT 
GAAAAAATCCCGGAAAACGAGAGTGCAGATGTCTGTCGGAATATTTCAGTCAATGTTCTCGACTGTGACACCATTGGCCAAGCC 
AAAGAAAAGATTTTCCAAGCATTCTTAAGCAAAAATGGCTCTCCTTATGGACTTCAGCTTAATGAAATTGGTCTTGAGCTTCAA 

40 ATGGGCACACGACAGAAAGAACTTCTGGACATCGACAGTTCCTCCGTGATTCTTGAAGATGGAATCACCAAGCTAAACACCATT 
GGCCACTATGAGATATCAAATGGATCCACTATAAAAGTCTTTAAGAAGATAGCAAATTTTACTTCAGATGTGGAGTACTCGGAT 
GACCACTGCCATTTGATTTTACCAGATTCGGAAGCATTCCAAGATGTGCAAGGAAAGAGACATCGAGGGAAGCACAAGTTCAAA 
GTAAAAGAAATGTATCTGACAAAGCTGCTGTCGACCAAGGTGGCAATTCATTCTGTGCTTGAAAAACTTTTTAGAAGCATTTGG 
AGTTTACCCAACAGCAGAGCTCCATTTGCTATAAAATACTTTTTTGACTTTTTGGACGGCCAGGCTGAAAACAAAAAAATCACA 

45 GATCCTGACGTCGTACATATTTGGAAAACAAACAGCCTTCCTCTTCGCTTCTGGGTAAACATCCTGAAGAACCCTCAGTTTGTC 
TTTGACATTAAGAAGACACGACATATAGACGGCTGTTTGTCAGTGATTGCCCAGGCATTCATGGATGCATTTTCTCTCACAGAG 
CAGCAACTAGGGAAGGAAGCACCAACTAATAAGCTTCTCTATGCCAAGGATATCCCAACCTACAAAGAAGAAGTAAAATCTTAT 
TACAAAGCAATCAGGGATTTGCCTCCATTGTCATCCTCAGAAATGGAAGAATTTTTAACTCAGGAATCTAAGAAACATGAAAAT 
GMTTTMTGAAGAAGTGGCCTTGACAGAAATTTACAAATACATCGTAAAATATTTTGATGAGATTCTAAATAAACTAGAAAGA 

50 GAACGAGGGCTGGAAGAAGCTCAGAAACAACTCTTGCATGTAAAAGTCTTATTTGATGAAAAGAAGAAATGCAAGTGGATGTAA 
GCACTCTGGGGCCTGGCTTAATCTGGCAAAGTTCTTCAGACGACTTGGGAGCAAAATGGCTGCTTGAGCTACTCTGTGTCGTTA 
ATTTGTTGTTTGCACATAGGTTCCACTTTGGGCACTGTCTTTTTAAGAGACCAAGGCACATGCACAGCTTTTAGAAAGCAA 
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ACGCGTCCGAAGACATTAAGTAAAAAATTGGAACTATGATTTTTCTTTGTCATTTTTTAAAAAAGAATTATTTTATTAACCTGC 
TGGCATATAATCTGGAGTTCTTTTCACAACCTTACTTTTTCTGATTTGCTTTATTGAATGATTGAATACTCATTTCTTTCTAAA 
AATATGTTGTAAATTCTCCCTTGGCAAGATTTCTCCCTATGAGGGTAGTTATTATTTGAGTCTGCCAAGTGGTTACCATGGGGC 
5 AAGGTGCCATGATGTATTCTTGGGTGCATTGGTTTTTTGCGCATTGTAAATTTAAGACACTTATAGTAAGTGGACTCATTCATA 
GATGAGTTTCAGAACCTTT.TACGTTCTCGGTAGAGGCTTCTGTCGGACAGGCAGAAGAGTGTATTCCTCACTTTTTTTTTTGTC 
TTCAAATTCCAGTMGGCATAGCACCT^ 

TATCTTATGTTACACTGGGAGTAATTTGAGGTGCAATTATTTTTATTACTACTTTGAATAGAGGACCATTATCCTTCTTTCTTC 
AGAAMCTMGMGTAAGTGTAACTTTTAAAGTAAGT^ 
10 AGTTGTGTTTTCATGTCTCATCAAAAGACAATACCACATTGCATCATTTTACAAAATATGTTGTCATTTTCATTTCAGTTGTAA 
CATAGGAAMTAGATATTTCCTAGATGATTTCTGAGTTTCTTACTGCAMGMCAGTTATAAATTGGTATACATGTGTCTCTGT 
AATAGGGATAATATTGATATATCTGTTGCTACATATTTAAGAATCATTCTATCTTATGTTGTCTTGAGGCCAAGATTTACCACG 
TTTGCCCAGTGTATTGAATTGGTGGra 

ATAGAATGTGGCTCAGTTCTGGTCCTTCAAGCCTGTATGGTTTGGATTTTCAGTAGGGGACAGTTGATGTGGAGTCAATCTCTT 
15 TGGTACACAGGAAGCTTTATAAAATTTCATTCACGAATCTCTTATTTTGGGAAGCTGTTTTGCATATGAGAAGAACACTGTTGA 
AATMGGMCTAMGCTTTATATATTGATCAAGGTGATTCTGAAAGTTTTMTTTTTAATGTTGTAATGTTATGTTATTGTTM 
TTGTACTTTATTATGTATTCAATAGAAAATCATGATTTATTAATAAAAGC^TAAATTCTCATCTAAAAAAAAAAAAAA^ 

67 

MGRGWGFLFGLLGAWLLSSGHGEEQPPETAAQRCFCQVSGYLDDCTCDVETIDRFMYRLFPRLQKLLESDYFRYYKVNLKRP 
20 CPFMDISQCGRRDCAVKPCQSDEVPDGIKSASYKYSEEAMLIEECEQAERLGAVDESLSEETQKAVLQWTKHDDSSDNFCEA 
DDIQSPEAEWDLLLNPERYTGYKGPDAWKIWNVIYEENCFKPQTIKRPLNPLASGQGTSEENTFYSWLEGLCVEKRAFYRLIS 
GLHASINVHLSARYLLQETWLEKKWGOT^ 

DEENKMLLLEILHEIKSFPLHFDENSFFAGDKKEAHKLKEDFRLHFRNISRIMDCVGCFKCRLWGKLQTQGLGTALKILFSEKL 
IANMPESGPSYEFHLTRQEIVSLFNAFGRISTSVKELENFRNLLQNIH 

25 68 

GCACGAGCCCCGGGCTGCCGGCGCGGGCGCCGCGGCACGTCCACAGGCTGGGTCGCGAGGTGGCGATCGCTGAGAGGCAGGAGG 
GCCGAGGCGGGCCTGGGAGGCG.GCCCGGAGGTGGGGCGCCGCTGGGGCCGGCCCGCACGGGCTTCATCTGAGGGCGCACGGCCC 
GCGACCGAGCGTGCGGACTGGCCTCCCAAGCGTGGGGCGACAAGCTGCCGGAGCTGCAATGGGCCGCGGCTGGGGATTCTTGTT 
TGGCCTCCTGGGCGCCGTGTGGCTGCTCAGCTCGGGCCACGGAGAGGAGCAGCCCCCGGAGACAGCGGCACAGAGGTGCTTCTG 
30 CCAGGTTAGTGGTTACTTGGATGATTGTACCTGTGATGTTGAAACCATTGATAGATTTAATAACTACAGGCTTTTCCCAAGACT 
ACAAAAACTTCTTGAAAGTGACTACTTTAGGTATTACAAGGTAAACCTGAAGAGGCCGTGTCCTTTCTGGAATGACATCAGCCA 
GTGTGGMGMGGGACTGTGCTGTCAMCCATGTCMTC^ 

TGAAGAAGCCAATAATCTCATTGAAGAATGTGAACAAGCTGAACGACTTGGAGCAGTGGATGAATCTCTGAGTGAGGAAACACA 
GAAGGCTGTTCTTCAGTGGACCAAGCATGATGATTCTTCAGATAACTTCTGTGAAGCTGATGACATTCAGTCCCCTGAAGCTGA 

35 ATATGTAGATTTGCTTCTTAATCCTGAGCGCTACACTGGTTACAAGGGACCAGATGCTTGGAAAATATGGAATGTCATCTACGA 
AGAAAACTGTTTTAAGCCACAGACAATTAAAAGACCTTTAAATCCTTTGGCTTCTGGTCAAGGGACAAGTGAAGAGAACACTTT 
TTACAGTTGGCTAGAAGGTCTCTGTGTAGAAAAAAGAGCATTCTACAGACTTATATCTGGCCTACATGCAAGCATTAATGTGCA 
TTTGAGTGCAAGATATCTTTTACAAGAGACCTGGTTAGAAAAGAAATGGGGACACAACATTACAGAATTTCAACAGCGATTTGA 
TGGMTTTTGACTGMGGAGAAGGTCCAAGAAGGCTTM^ 

40 AGTGTTACCATTCTTCGAGCGCCCAGATTTTCAACTCTTTACTGGAAATAAAATTCAGGATGAGGAAAACAAAATGTTACTTCT 
, GGAMTACTTCATGAAATCAAGTCATTTCCTTTGCATTTTGATGAGAATTCATTTTTTGCTGGGGATAAAAAAGAAGCACACAA 
ACTAAAGGAGGACTTTCGACTGCATTTTAGAAATATTTCAAGAATTATGGATTGTGTTGGTTGTTTTAAATGTCGTCTGTGGGG 
AMGCTTCAGACTCAGGGTTTGGGCACTGCTCTGAAGATCTTATTTTCTGAGAAATTGATAGCAAATATGCCAGAAAGTGGACC 
TAGTTATGAATTCCATCTAACCAGACAAGAAATAGTATCATTATTCAACGCATTTGGAAGAATTTCTACAAGTGTGAAAGAATT 

45 AGAAAACTTCAGGAACTTGTTACAGAATATTCATTAAAGAAAACAAGCTGATATGTGCCTGTTTCTGGACAATGGAGGCGAAAG 
AGTGGMTTTCATTCAAAGGCATMTAGCMTGACAGTCT 

ATATTGTTTTAAGTAAACACATTTTTAAAAATTGTGTTAAGTCTATGTATAATACTACTGTGAGTAAAAGTAATACTTTAATAA 
TGTGGTACAAATTTTAAAGTTTMTATTGMTAAMGGAGGATTATCAMTTCATATATGATAAMGTGMTGTTCTAAGTCTC 
TCAAACTAGCGTTTTATGTAATMTATGTMTATAAATAAMCTATGGTAAATGTGACAAGCATTTAATAGGAAAATGCTAAGG 
50 AGGCCTCATAAATGACCCATAATTACCAACGTAGAATTTTTCAGTACATTTAGGGTTGCTGGATTTAGCAAATAAAAATAAAGA 
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TTGCCCAGTTAGATTTGAATTTCAGATAAACAATTAGTTTTTTAATATTTTACATGGAATATTTGGAAAATACTTATACTAAAA 
AATTATTTGTTTGAAATTCACATTTAAGTGGGAGTCTTGTATTTTATCTGGCAATCCTAAAATACATTGGTATGAAACAAATCA 
CTTTTAGAAGTATATTGCTATTTTGATTGGGTTGTTTTTGTGTGTAGAAACGTACAATAACAACTGAAAGGCACAGGAGATTTC 
TAAACATTGTGAAAAGTTGAATAGATTATATATTTATTCTCATAATACTTTCACTAATACTAAATAAAATTTGGGGAACACTTT 
5 TTATTTTTATATMTTTCCMTTTACAGAAAAGTO^ 

ACGTGCTTTATCTTTCATGCTTTCTCTGTACACACACACACACACACAAATTTTTCCTCAATCATTTGAAAGTCAGTTATAGGC 
ATCATGCCCCTTAAACCCTAAATACTTCAGTGTGTAATACTC 

CAATTCTGGAACTATAATACTGTMGCCTTAGAATAMTMTACTTTCAAGTTCCAATCTAAAGTTCTTTTTGAGTTTTGTTGC 
CCGTTTTATGCTTGATGTGTATAGTAATAGGGTAGGCTATTTATTTTATTAAAATTTTTTTTAGAGACAAGGTTTTGCTGTGTT 

10 GCCCAAGCTGGAACTTGAACGACTGGGCTGAAGTGATCTTCCCACCTCAGCCTCCCAAGTAGCTGGGAATACAGGTGTCTGCCA 
CCATACCCAGTTTCATTTTTGTTTTTTATACCCGAAGTTCATTTCCTTTGTCTCCCTAAAACTGAACTGTAATTTTGGGAGGTT 
TTCATTAGTGGAAGCTCTTCATTTATAAAGCTATTTGAAGGGGTTTAGGAATTTATATCACATGGTAATTGTAGAGAAAAAGAA 
GCTATATACCTCAAAATCGTGCCCTCTTTACATATGTCTTATCAGGTATAACATGTTGAAATGTCACATTAGTAGTAAAGTGGG 
GTTTATTTATATAGTGGTTAAGAAATGTCAGTTTACACTGCTGTATACTTCTTCTTCTGTGTCCCTAAGGCCTGGTACAGTGCC 

1 5 AAGCACATACTTGGTATCCAATAAATATTTGTTGGATGAAAAAAAAAAAAAAAAAAAA 

70 

GGCACGAGGAGATGGCGGCAGCGGCGCTGGGGAGGGCGAGGCGGAGGCGGCAAAACGGGCGGTCGAGCAGAACGTGTAGCCGCG 
TCCCCTCCAGTCCGCTCCGGGCAGCTGCTGATGCAAGGAATCCCCTGGGCTCCCGTCCACTCCACTGCTGACCAGCCCATTCGC 
CTGTGCTGAGTCTTCCTGCAGGCCTTTCCTTGCCTCTGTGGGACCCTGTGGGGGTCCATCCGGCTGGAGAAGAAAAGCCTCTCA 

20 TGCTAACGTTGCAGACCCCAGAGGGTCCTGTGTGGGTGTGGAGATGGCCAATGAGAATCACGGCAGCCCCCGGGAGGAAGCGTC 
CCTGCTGAGTCACTCCCCAGGTACCTCCAATCAGAGCGAGCCCTGTTCTCCAAAGCCAATCCGCCTGGTTCAGGACCTCCCAGA 
GGAGCTGGTGCATGCAGGCTGGGAGAAGTGCTGGAGCCGGAGGGAGAATCGTCCCTACTACTTCAACCGATTCACCAACCAGTC 
CCTGTGGGAGATGCCCGTGCTGGGGCAGCACGATGTGATTTCGGACCCTTTGGGGCTGAATGCGACCCCACTGCCCCAAGACTC 
AAGCTTGGTGGAAACTCCCCCGGCTGAGAACAAGCCCAGAAAGCGGCAGCTCTCGGAAGAGCAGCCAAGCGGCAATGGTGTGAA 

25 GAAGCCCAAGATTGAAATCCCAGTGACACCCACAGGCCAGTCGGTGCCCAGCTCCCCCAGTATCCCAGGAACCCCAACGCTGAA 
GATGTGGGGTACGTCCCCTGAAGATAAACAGCAGGCAGCTCTCCTACGACCCACTGAGGTCTACTGGGACCTGGACATCCAGAC 
CAATGCTGTCATCAAGCACCGGGGGCCTTCAGAGGTGCTGCCCCCGCATCCCGAAGTGGAACTGCTCCGCTCTCAGGTCATCCT 
GAAGCTTCGGCAGCACTATCGGGAGCTGTGCCAGCAGCGAGAGGGCATTGAGCCTCCACGGGAGTCT-TTCAACCGCTGGATGCT 
GGAGCGCAAGGTGGTAGACAAAGGATCTGACCCCCTGTTGCCCAGCAACTGTGAACCAGTCGTGTCACCTTCCATGTTTCGTGA 

30 AATCATGAACGACATTCCTATCAGGTTATCCCGAATCAAGTTCCGGGAGGAAGCCAAGCGCCTGCTCTTTAAATATGCGGAGGC 
CGCCAGGCGGCTCATCGAGTCCAGGAGTGCATCCCCTGACAGTAGGAAGGTGGTCAAATGGAATGTGGAAGACACCTTTAGCTG 
GCTTCGGAAGGACCACTCAGCCTCCAAGGAGGACTACATGGATCGCCTGGAGCATCTGCGGAGGCAGTGTGGCCCCCACGTCTC 
GGCCGCAGCCAAGGACTCCGTGGAAGGCATCTGCAGTAAGATCTACCACATCTCCCTGGAGTACGTCAAACGGATCCGAGAGAA 
'GCACCTTGCCATCCTCAAGGAAAACAACATCTCAGAGGAGGTGGAGGCCCCTGAGGTGGAGCCCCGCCTAGTGTACTGCTACCC 

35 AGTCCGGCTGGCTGTGTCTGCACCGCCCATGCCCAGCGTGGAGATGCACATGGAGAACAACGTGGTCTGCATCCGGTATAAGGG 
AGAGATGGTCAAGGTCAGCCGCAACTACTTCAGCAAGCTGTGGCTCCTTTACCGCTACAGCTGCATTGATGACTCTGCCTTTGA 
GAGGTTCCTGCCCCGGGTCTGGTGTCTTCTCCGACGGTACCAGATGATGTTCGGCGTGGGCCTCTACGAGGGGACTGGCCTGCA 
GGGATCGCTGCCTGTGCATGTCTTTGAGGCCCTCCACCGACTCTTTGGCGTCAGCTTCGAGTGCTTCGCCTCACCCCTCAACTG 
CTACTTCCGCCAGTACTGTTCTGCCTTCCCCGACACAGACGGCTACTTTGGCTCCCGCGGGCCCTGCCTAGACTTTGCTCCACT 

40 GAGTGGTTCATTTGAGGCCAACCCTCCCTTCTGGGAGGAGCTCATGGATGCCATGGTCTCTCACTTTGAGAGACTGCTTGAGAG 
CTCACCGGAGCCCCTGTCCTTCATCGTGTTCATCCCTGAGTGGCGGGAACCCCCAACACCAGCGCTCACCCGCATGGAGCAGAG 
CCGCTTCAAACGCCACCAGTTGATCCTGCCTGCCTTTGAGCATGAGTACCGCAGTGGCTCCCAGCACATCTGCAAGAAGGAGGA 
AATGCACTACAAGGCCGTCCACAACACGGCTGTGCTCTTCCTACAGAACGACCCTGGCTTTGCCAAGTGGGCGCCGACGCCTGA 
ACGGCTGCAGGAGCTGAGTGCTGCCTACCGGCAGTCAGGCCGCAGCCACAGCTCTGGTTCTTCCTCATCGTCCTCCTCGGAGGC 

45 CAAGGACCGGGACTCGGGCCGTGAGCAGGGTCCTAGCCGCGAGCCTCACCCCACTTAACATATCCTGCGGGGAGGAGGAGCCCC 
AGGGGTGCTAGTCTGGACTGCTGGGACTCGGGCCCCTGGGGCCTCAGAGGGACCCCGGCTGCCACTGACATATGAAGATTATGG 
TTCTGCCAGGGCTCCCCTCCCTGCCTGTCCCCAAGTCCTCACCTCAAACTCCCTCCAAGTCCCATGTATATAGGTCCTGATGCC 
TTCCCMCCCCGCCCCTCACCCTGTTGCCACCTTGTTTCATTTGTAAAAGGAAATACAGAMCCCCCCCAAAAAAAAAAAAAAA 
AAAAAAAAAAAA 
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TATCATGCATGTGGGAAGGTGGGTGTGGTGAGAAAAGTTTTAAGGCAAGAGTAGATGGCCATGTTCAACTTTACAAMTTTCTT 
GGAAAACTGGCAGTATTTTGAACTGCATCTTCTTTGGTACCGGAACCTGCAGAAACAGTGTGAGAAATTAAGTCCTGGTTCACT 
GCGCAGTAGCAAAGATGGTCAAGGCCATGGAAAMGCAGA 

TCAAATACATCTGCTATCTTTTTAAGCTAAGTCCTAGACATATCGGGGATAACATGGGGGTTGATTAGTGACCACAGTTATCAG 
5 AAGCAGAGAAATGTAATTCCATATTTTATTTGAAACTTATTCCATATTTTAATTGGATATTGAGTGATTGGGTTATCAAACACC 
CACAMCTTTAATTTTGTTAMCT^ 

TATTTTACCATCTTTAATCATCTTGGAAAATACAAGTCCTGTGAACAACCACTCTTTCACCTAGCAGCATGAGGCCAAAAGTAA 
AGGCTTTAMTTATMCATATGGGATTCTTAGTAGTATGTTTTTTTCTTGAAACTCAGTGGCTCTATCTMCCTTACTATCTCC 
TCACTCTTTCTCTAAGACTAAACTCTAGGCTCTTAAAMTC^ 
1 0 AATATCTTTTAATAGTAACATGTATOT 

CACACTGAGTTGGGAATTTCTTATCCCAGAAGACCAACCAATTTCATATTTATTTAAGATTGATTCCATACTCCGTTTTCAAGG 
AGMTCCC0XX1AGTCTCOT 

CTCTAAAAAAACAGAGAACAAATATGTCTCAGTTGTATTAAGCACGGACCCATATTATCATATTCACTTAAAAAAATGATTTCC 
TGTGCACCTTTTGGCAACTTCTCTTTTCAATGTAGGGAAAAACTTAGTCACCCTGAAAACCCACAAAATAAATAAAACTTGTAG 

15 ATGTGGGCAGAAGGTTTGGGGGTGGACATTGTATGTGTTTAAATTAAACCCTGTATCACTGAGAAGCTGTTGTATGGGTCAGAG 
AAAATGAATGCTTAGAAGCTGTTCACATCTTCAAGAGCAGAAGCAAACCACATGTCTCAGCTATATTATTATTTATTTTTTATG 
CATAAAGTGAATCATTTCTTCTGTATTAATTTCCAMGGGTTTTACCCTCTATTTAAATGCTTTGAAAAACAGTGCATTGACAA 
TGGGTTGATATTTTTCTTTAAAAGAAAAATATAATTATGAAAGCCAAGATAATCTGAAGCCTGTTTTATTTTAAAACTTTTTAT 
' GTTCTGTGGTTGATGTTGTTTGTTTGTTTGTTTCTATTTTGTTGGTTTTTTACTTTGTTTTTTGTTTTGTTTTGTTTTGTTTTG 

20 CATACTACATGCAGTTCTTTAACCAATGTCTGTTTGGCTAATGTAATTAAAGTTGTTAATTTATATGAGTGCATTTCAACTATG 
TCAATGGTTTCTTAATATTTATTGTGTAGAAGTACTGGTAATTTTTTTATTTACAATATGTTTAAAGAGATAACAGTTTGATAT 
GTTTTCATGTGTTTATAGCAGAAGTTATTTATTTCTATGGCATTCCAGCGGATATTTTGGTGTTTGCGAGGCATGCAGTCAATA 
TTTTGTACAGTTAGTGGACAGTATTCAGCAACGCCTGATAGCTTCTTTGGCCTTATGTTAAATAAAAAGACCTGTTTGGGATGT . 

73 ' ' 

25 MREIVHIQAGQCGNQIGAKFWEVISDEHGIDPSGNYVGDSDLQLERISVYYWEASSHKYVPRAILVDLEPGTMDSVRSGAFGHL 
FRPDNFIFGQSGAGlSnMAKGHYTEGAELVDSV^ 

WPS PK VSDT WE PYNATL S I HQL VENTDETYC IDNEALYDICFRTLKLATPTYGDLNHL VSATMSGVTTSLRFPGQLNADLRK 
LAVNMVPFPRLHFFMPGFAPLTRRGSQQYRALTVPEM^ 
NSSYFVEWIPMWKVAVCDIPPRGLKMSSTFIGN^^ 
30 EYQQYQDATAEEEGEMYEDDEEESEAQGPK 

ATGCGGGAGATCGTGCACATCCAGGCCGGCCAGTGCGGCAACCAGATCGGGGCCAAGTTCTGGGAAGTCATCAGTGATGAGCAT 
GGCATCGACCCCAGCGGCAACTACGTGGGCGACTCGGACTTGCAGCTGGAGCGGATCAGCGTCTACTACAACGAGGCCTCTTCT 
CACAAGTACGTGCCTCGAGCCATTCTGGTG.GACCTGGAACCCGGAACCATGGACAGTGTCCGCTCAGGGGCCTTTGGACATCTC 

35 TTCAGGCCTGACAATTTCATCTTTGGTCAGAGTGGGGCCGGCAACAACTGGGCCAAGGGTCACTACACGGAGGGGGCGGAGCTG 
GTGGATTCGGTCCTGGATGTGGTGCGGAAGGAGTGTGAAAACTGCGACTGCCTGCAGGGCTTCCAGCTGACCCACTCGCTGGGG 
GGGGGGACGGGCTCCGGCATGGGCACGTTGCTCATCAGCAAGGTGCGTGAGGAGTATCCCGACCGCATCATGAACACCTTCAGC 
GTCGTGCCCTCACCCAAGGTGTCAGACACGGTGGTGGAACCCTACAACGCCACGCTGTCCATCCACCAGCTGGTGGAAAACACG 
GATGAAACCTACTGCATCGACAACGAGGCGCTCTACGACATCTGCTTCCGCACCCTCAAGCTGGCCACGCCCACCTACGGGGAC 

40 CTCAACCACCTGGTATCGGCCACCATGAGCGGAGTCACCACCTCCTTGCGCTTCCCGGGCCAGCTCAACGCTGACCTGCGCAAG 
CTGGCCGTCAACATGGTGCCCTTCCCGCGCCTGCACTTCTTCATGCCCGGCTTCGCCCCCCTCACCAGGCGGGGCAGCCAGCAG 
TACCGGGCCCTGACCGTGCCCGAGCTCACCCAGCAGATGTTCGATGCCAAGAACATGATGGCCGCCTGCGACCCGCGCCACGGC 
CGCTACCTGACGGTGGCCACCGTGTTCCGGGGCCGCATGTCCATGAAGGAGGTGGACGAGCAGATGCTGGCCATCCAGAGCAAG 
AACAGCAGCTACTTCGTGGAGTGGATCCCCAACAACGTGAAGGTGGCCGTGTGTGACATCCCGCCCCGCGGCCTCAAGATGTCC 

45 TCCACCTTCATCGGGAACAGCACGGCCATCCAGGAGCTGTTCAAGCGCATCTCCGAGCAGTTCACGGCCATGTTCCGGCGCAAG 
GCCTTCCTGCAGTGGTACACGGGCGAGGGCATGGACGAGATGGAGTTCACCGAGGCCGAGAGCAACATGAACGACCTGGTGTCC ( 
GAGTACCAGCAGTACCAGGACGCCACGGCCGAGGAAGAGGGCGAGATGTACGAAGACGACGAGGAGGAGTCGGAGGCCCAGGGC 
CCCAAGTGAAACTGCTCGCAGCTGGAGTGAGAGGCAGGTGGCGGCCGGGGCCGAAGCCAGCAGTGTCTAAACCCCCGGAGCCAT 
CTTGCTGCCGACACCCTGCTTTCCCCATCGCCCTAGGGCTCCCTTGCCGCCCTCCTGCAGTATTTATGGCCTCGTCCTCCCCCA 
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CCTAGGCCACGTGTGAGCTGCTCCTGTCTCTGTCTTATTGCAGCTCCAGGCCTGACGTTTTACGGTTTTGTTTTTTACTGGTTT 
GTGTTTATATTTTCGGGGATACTTAATAAATCTATTGCTGTCAGATACCCTT 

75 

MEGETESPGPKKCGPYISSVTSQSWLMIRGWLFFIGVFLALVLNLLQIQR3WTLFPPDVIA 
5 GLLYPCIDRHLGEPHKFKREWSSVMRCVAVFVGINHASAKVDFDMIQLSLTLAALSIGLWWTFDRSRSGFGLGVGIAFLATW 
TQLLVYNGVYQYTSPDFLYWSWLPCIFFAGGITMGNIGRQLAMYECKVIAENLIRNEEGKKYLLYRKAR 

76 

ACTCACTATAGGGCTCGAGCGGCCGCCCGGGCAGGTGGGATTTCGGGTAGGTCCTACTTTAGGACAAGATGTGGTACCGTTGAA 
GCGTCAGTCTTTGATTCACAGACAGTTGAGCTTTTCAGCTGGGAAGCCTTTCCATTTTTTTTTTTTTAACGGCTTTCTGAACCT . 

10 ATGAAACCATGGCAGAAGGAGAGACAGAGTCACCTGGGCCCAAAAAGTGTGGCCCATATATTTCATCTGTCACTAGCCAGAGTG 
TGAACTTGATGATTCGAGGAGTAGTGCTATTTTTTATTGGAGTATTTCTTGCATTAGTGTTAAATTTACTTCAGATTCAGAGAA 
ATGTGACGCTCTTTCCACCTGATGTGATTGCAAGCATCTTTTCTTCTGCATGGTGGGTACCCCCATGCTGTGGCACGGCTTCAG 
CTGTGATTGGGTTATTATACCCCTGCATTGACAGACATCTAGGAGAACCACATAAATTTAAAAGAGAGTGGTCCAGTGTAATGC 
GGTGTGTAGCAGTCTTTGTTGGTATAAATCATGCCAGTGCTAAAGTGGATTTCGATAAGAACATACAGTTGTCTCTCACACTGG 

15 CTGCACTATCCATTGGACTGTGGTGGACTTTTGATAGATCTAGAAGTGGTTTTGGCCTTGGAGTAGGAATTGCCTTCTTGGCAA 
CTGTGGTCACTCAACTGCTAGTATATAATGGTGTTTACCAATATACATCTCCAGATTTCCTCTATGTTCGTTCTTGGTTACCAT 
GTATATTTTTTGCTGGAGGCATAACAATGGGAAACATTGGTCGACAACTGGCAATGTACGAATGTAAAGTTATCGCAGAAAATC 
TCATCAGGAATGMGAAGGCAAAAAATATCTTTTGTACAGAAAAGCAAGATGAAAAGGATGTGAAATGGTAGATATACCAACAA 
AACTTCAGACTGTAAAATTGCCAGGATGCAGTTTTCCCCTTGAATGGCGTGTGTGTATATATGGATAAATATATATATACACAC 

20 ACACATATTACTGCAATCTGTGATTGCTTCATCTGTAAATCAGTTGTAAACCTTTACATATTTGACTTAAATAACTGTAAGATA 
TATATGTACTACATTAAAAAGTGTTGATTAATAGATGAAATTTTTAAATTAATTTTTTAAAACATGCCATACATTGTATCACAA 
TGTTAATGTGCCAAGATATTGTTCCTGTCATGCAGAGTATAAGAATGCTTTGAACAATTTGTAGGACTTAGTGGAAATAAAATA 
AGAGGGAAAGGCCAAAAACAAACAAACAAAAGGCATATGGGGAGCTGGGTATTTTCTCTTTAGGCTTACTGTTGTGGCCTTTTT 
ATTTTTCCTAATCCACGCCGGTATGGGGTTTGGGGGGCCCCAATTGTTGG 

25 77 

MSKLLRAVILGPPGSGKGTVCQRIAQNFGLQHLSSG^ 

WLLDGFPRTLGQAEALDKICEVDLVISLNIPFETLKDRLSRRWIHPPSGRVYNLDFNPPHVHGIDDVTGEPLVQQEDDKPEAVA 
ARLRQYKDVAKPVIELYKSRGVLHQFSGTETNKIWPYVYTLFSNKITPIQSKEAY 

78 

30 CGGCGCTGGGCTGAGGGGAGGGGTTGTCTTAAAAGTCTCTCCTTCCCCCTGTAGGGGCGGCCGGCGAGTCCCAGTGAGAGCGGA 
GGGTGCCAGAGGTAGGGGGCCGAGAAACAAAGTTCCCGGGGCTTCCTCCGGGGCCGCGGTCGGGGCTGCGCGTTTGACCGCCCC 
CCTCCTCGCGAAGCAATGGCTTCCAAACTCCTGCGCGCGGTCATCCTCGGGCCGCCCGGCTCGGGCAAGGGCACCGTGTGCCAG 
AGGATCGCCCAGAACTTTGGTCTCCAGCATCTCTCCAGCGGCCACTTCTTGCGGGAGAACATCAAGGCCAGCACCGAAGTTGGT 
GAGATGGCAAAGCAGTATATAGAGAAAAGTCTTTTGGTTCCAGACCATGTGATCACACGCCTAATGATGTCCGAGTTGGAGAAC 

35 AGGCGTGGACAGCACTGGCTCCTTGATGGTTTTCCTAGGACATTAGGACAAGCCGAAGCCCTGGACAAAATCTGTGAAGTGGAT 
CTAGTGATCAGTTTGAATATTCCATTTGAAACACTTAAAGATCGTCTCAGCCGCCGTTGGATTCACCCTCCTAGCGGAAGGGTA 
TATAACCTGGACTTCAATCCACCTCATGTACATGGTATTGATGACGTCACTGGTGAACCGTTAGTCCAGCAGGAGGATGATAAA 
CCCGAAGCAGTTGCTGCCAGGCTAAGACAGTACAAAGACGTGGCAAAGCCAGTCATTGAATTATACAAGAGCCGAGGAGTGCTC 
CACCAATTTTCCGGAACGGAGACGAACAAAATCTGGCCCTACGTTTACACACTTTTCTCAAACAAGATCACACCTATTCAGTCC 

40 AAAGAAGCATATTGACCCTGCCCAATGGAAGAACCAGGAAGATGTGGTCATTCATTCAATAGTGTGTGTAGTATTGGTGCTGTG 
TCCAAATTAGMGCTAGCTGAGGTAGCTTGCAGCATCTTTTCTAGTTGAAATGGTGAACTGATAGGAAAACAAATGAGTAGAAA 
GAGTTCATGAAGAGGCCCTCCTCTGCCTTTCAAAAGGCTGGTCACCTACACATGTTTAAGGTGTCTCTGCACATGTCTCAAGCC 
CATCACAAGAAAGCAAGTACAGTGTGGATTTCAAATGGTGTGTAACTTCAGCTCCAGCTGGTTTTTGACAGCTGTTGCTGTGGT 
AATATTTTTGACATGTGATGGTGATAGTCTCTGGTTCTCCCCATCCCCACAAAGGCTGTTGAACCACAGCACCAGGAAGCCTGA 

45 GAATGAATCCTGAGGGCTCTAGCCCAGGCTTTGTCCCAGGCTTTCTGGTGTGTGCCCTCCTGGTAACAGTGAAATTGAAGCTAC 
TTACTCATAGTGGTTGTTTCTCTGGTCTTGAGTGACTGTGTCCACAGTTCATTTTTTTCCGGTAGGAATAACTCCTTTTCTACA 
TCCACGCTCCATAGAGTCTCTCCTTTTCAGACATCCTGGGATGAAAGAATTTGGCTTTTTTTTTTCTTTTTTTTTTGGACATCT 
GTTTTCACTCTTAGGCTTTTAAACAATAGTTATTGCTTTTATCCCTCTCAGATTCTAATAACTGAGAGCGATGGGGCTATATTG 
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AATCTCTGTATGCACTGAGAACTGAGCTATGAAGAGAATCTTATTAAACTGCTGGTCTGACTTTATGGATTGACACTGTTCCTT 
TCTTTTATTGTGAAAAAAAAAAAAAAA 

79 

MGSVAASAGVPRGCDILIVYSPDAEEWCQYLiQTLFLSSRQVRSQKILTHRLGPEASFSAEDLSLFLSTRCVWLLSAELVQHFH 
5 KPSLLPLLQRAFHPPHRWRLLC^^ 

» KQQNLPTVTSPGNLMWQPDRIRCGAETTVWIVRCKLDDRVGTEAEFSPEDSPSVRMEAKVENEYTISVKAPHLSSGIWSLKI 
YSGDLWCETVISYYTDMEEIGNLLSNAANPVEFMCQAFKIVPYNTETLDKLLTESLKMIPASGLHLFGINQLEEEDMMTNQR 
DEELPTLLHFAAKYGLKNLTALLLTCPGALQAYSVMKHGHYPNTIAEKHGFRDLRQFIDEYVETVDMLKSHIKEELMHGEEAD 
AWE SMAHLSTDLLMKCSLNPGCDEDLYESiyLAAFVPMTEELYVEMLQASTSNP I PGDGFSRATKD SMIRKS 
10 RDQCHLGQEEDVYHTVDDDEAFSVDLASRPPVPVPRPETTAPGAHQLPDNEPYIFKVFAEKSQERPGNFYVSSESIRKGPPVRP 
WRDRPQSSIYDPFAGMKTPGQRQLITLQEQVKLGIVNVDEAVLHFKEWQLNQKRRSESFRFQQENLKRLRDSITRRQREKQKSG 
KQTDLEITVPIRHSQHLPAKVEFGVYESGPRKSVIPPRTELRRGDWKTDSTSSTASSTSNRSSTRSLLSVSSGMEGDNEDNEVP 
EVTRSRSPGPPQVDGTPTMSLERPPRVPPRAASQRPPTRETFHPPPPVPPRGR • 

80 

15 AGACCCCCCAGGGTGCCTCCGAGAGCTGCCTCACAGAGGACTCCGACCAGGGAGACCTTCCATCCTCCTCCACCTGTTCCACCC 
AGAGGACGCTGATTCCACCTCCTAAAACCTGCCTACTTCAGGACTTTAAGACTCACAGTCTTCAGCCTGTTAATGATGTCTTCA 
TGTTGAGTTTTATAGCATGACTGTTGACCTTAAGATCCATTCTCATTGCTGATAATGCTGCAGCCCTGCTGGTTTGGGCTTGCT 
CGAAGATTTTATTAAGGCACGAAGAAGTGAAAAACTAAGGGCTTCATTCACCATCACCAAGTATATCGAACCATATACTTGTTT 
GCCAAAAGGGTGGAAGGACTTTAATTCGGAAATACTTTACCTCTTAATTTGGCCATATTCAGGAGGCCTAAAAGGATTGATCCN 

20 TAATGTTATTTCACCGTGGTTTTTATTGAATGG 

82 

TTTGTGTGGTCTTTTTTTTTTTTTCTCAGTGTTTTCAATTCCTCCTTGGTTGAATCCATGGATGCAAAACCCACAGATATGAAG 
GGCTGGCTATATATGCATTGATGATTGTCCTATTATATTAGTTATAAAGTGTCATTTAATATGTAGTGAAAGTTATGGTACAGT 
GGAAAGAGTAGTTGAAAACATAAACATTTGGGCCTTTCAAGAAAGGTAGCTTGGTGAAGTTTTTCACCTTCAAACTATGTCCCA 
25 GTCAGGG 

84 

GCTTGAGATCTCCAGCCTTACCGCGGCTCGAAATGGCCCCAACTGCTCCTGCACCACTGGTGTCTCCTGCANCTGCACCGGCTC 
CTGCAAGTGCAAAGAGTGCAAATGACCTCCTGCAAGAAGAGCTGCTGNNCTGCTGCCCCGTGGTGTGCAAGTGTGCCACGGTGT 
GTCTGCAMGGGACGTTGGAGAACTGCAGCTGCTGTNCTGATGTGGGAACAAGCTCTTCTCCCAGATGTTAATAGAACAAGCTG 
30 CACAACCTGGGATTTTTTTTCAATACGATACTTGAGCCATTTTCTGCATTTCTTTTTATTGTTAAATATGTGAGTGACAATAAA 
ACAATTTTGACTTTGAATCTTAAAAA 

85 

MPLGHIMRLDLEKIALEYIVPCLHEVGFCYLDNFLGEWGDCVLERVKQLHCTGALRDGQLAGPRAGVSKRHLRGDQITWIGGN 
EEGCEAISFLLSLIDRLVLYCGSRLGKYWKERSKAW^ 
35 PEGKSFIADVEPIFDRLLFFWSDRRNPHEVQPSYATRYAMTVWYFDAEERAKAKKKFRNLTRKTESALTED- 

86 

GAGTCTGGCCGCAGTCGCGGCAGTGGTGGCTTCCCATCCCCAAAAGGCGCCCTCCGACTCCTTGCGCCGCACTGCTCGCCGGGC 
CAGTCCGGAAACGGGTCGTGGAGCTCCGCACCACTCCCGCTGGTTCCCGAAGGCAGATCCCTTCTCCCGAGAGTTGCGAGAAAC 
TTTCCCTTGTCCCCGACGCTGCAGCGGCTCGGGTACCGTGGCAGCCGCAGGTTTCTGAACCCCGGGCCACGCTCCCCGCGCCTC 

40 GGCTTCGCGCTCGTGTAGATCGTTCCCTCTCTGGTTGCACGCTGGGGATCCCGGACCTCGATTCTGCGGGCGAGATGCCCCTGG 
GACACATCATGAGGCTGGACCTGGAGAAAATTGCCGTGGAGTACATCGTGCCCTGTCTGCACGAGGTGGGCTTCTGCTACCTGG 
ACAACTTCCTGGGCGAGGTGGTGGGCGACTGCGTCCTGGAGCGCGTCAAGCAGCTGCACTGCACCGGGGCCCTGCGGGACGGCC 
AGCTGGCGGGGCCGCGCGCCGGCGTCTCCAAGCGACACCTGCGGGGCGACCAGATCACGTGGATCGGGGGCAACGAGGAGGGCT 
GCGAGGCCATCAGCTTCCTCCTGTCCCTCATCGACAGGCTGGTCCTCTACTGCGGGAGCCGGCTGGGCAAATACTACGTCAAGG 

45 AGAGGTCTAAGGCAATGGTGGCTTGCTATCCGGGAAATGGAACAGGTTATGTTCGCCACGTGGACAACCCCAACGGTGATGGTC 
GCTGCATCACCTGCATCTACTATCTGAACAAGAATTGGGATGCCAAGCTACATGGTGGGATCCTGCGGATATTTCCAGAGGGGA 
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AATCATTCATAGCAGATGTGGAGCCCATTTTTGACAGACTCCTGTTCTTCTGGTCAGATCGTAGGAACCCACACGAAGTGCAGC 
CCTCTTACGCAACCAGATATGCTATGACTGTCTGGTACTTTGATGCTGAAGAAAGGGCAGAAGCCAAAAAGAAATTCAGGAATT 
TAACTAGGAAAACTGAATCTGCCCTCACTGAAGACTGACCGTGCTCTGAAATCTGCTGGCCTTGTTCATTTTAGTAACGGTTCC 
TGAATTCTCTTAAATTCTTTGAGATCCAAAGATGGCCTCTTCAGTGACAACAATCTCCCTGCTACTTCTTGCATCCTTCACATC 
5 CCTGTCTTGTGTGTGGTACTTCATGTTTTCTTGCCAAGACTGTGTTGATCTTCAGATACTCTCTTTGCCAGATGAAGTTATTTG 
CTAACTCCAGAAATTCCTGCAGACATCCTACTCGGCCAGCGGTTTACCTGATAGATTCGGTAATACTATCAAGAGAAGAGCCTA 
GGAGCACAGCGAGGGAATGAACCTTACTTGCACTTTATGTATACTTCCTGATTTGAAAGGAGGAGGTTTGAAAAGAAAAAAATG 
GAGGTGGTAGATGCCACAGAGAGGCATCACGGMGCCTTMCAGCAGGAAACAGAGAAATTTGTGTCATCTGAACAATTTCCAG 
ATGTTCTTAA.TCCAGGGCTGTTGGGGTTTCTGGAGAATTATCACAACCTAATGACATTAATACCTCTAGAAAGGGCTGCTGTCA 
10 TAGTGAACAATTTATAAGTGTCCCATGGGGCAGACACTCCTTTTTTCCCAGTCCTGCAACCTGGATTTTCTGCCTCAGCTCCAT 
TTTGCTGAAAATAATGACTTTCTGMTAAkG^ 
CCAGAAGCTAAAAMCTAGACATTAGTTGTTTTGGTTGCTOT 

CCTGGTAGTTTTGTGTTAACCACTGATAACTGTGGAAAGAGCTAGGTCTACTGATATACAATAAACATGTGTGCATCTTGAACA 
ATTTGAGAGGGGAGGTGGAGTTGGAAATGTGGGTGTTCCTGTTTTTTTTTTTTTTTTTTTTTTTTTTTAGTTTTCCTTTTTAAT 

15 gagctcaccctttaacacaaaaaaagcagggtgatgtatto 

ttctcgtctgtccctagcagtctttcttcagctcacttggctctctagatccactgtggttggcagtatgaccagaatcatgga 
acttgctagaactgtggaagcttctactcctgcagtaagcacagatcgcactgcctcaataacttggtattgagcacgtatttt 
gcaaaagctacttttcctagttttcagtattactttcatgttttaaaaatccctttaatttcttgcttgaaaatcccatgaaca 
ttaaagagccagaaatattttcctttgto^ 

20 cttctggttttggaaaatttccagataagacatgtcaccattaattctcaacgactgctctattttgttgtacggtaatagtta 
tcaccttctaaattactatgtaatttactcacttattatgtttattgtcttgtatcctttctctggagtgtaagcacaatgaag 
acaggaattttgtatatttttaaccaatgcmcatactctcagcacctaaaatagtgccgggaacatagtaagggctcagtaaa 
tacttgttgaatamctcagtctcctaca™^ 

87 

25 merrrlwgsiqsryismswtsprrlvelagqsllkdealai^ 

VTjMKGQHLHLETFKAVLDGLDVLLAQ 
EQPFIPVEVLVDLFLKEGACDELFSYLIEKVKR^ 

SPYLGQMINLRRLLLSHIHASSYISPEKEEQYIAQFTSQFLSLQCLQALYVDSLFFLRGRLDQLLRHVMNPLETLSITNCRLSE 
GDVMHLSQSPSVSQLSVLSLSGVMLTDVSPEPLQALLERASATLQDLVFDECGITDDQLLALLPSLSHCSQLTTLSFYGNSISI 
30 SALQSLLQHLIGLSNLTHVLYPVPLESYEDIHGTLHLERLAYLHARLRELLCELGRPSMVWLSMPCPHCGDRTFYDPEPILCP 
CFMPN 

GCTTCAGGGTACAGCTCCCCCGCAGCCAGAAGCCGGGCCTGCAGCCCCTCAGCACCGCTCCGGGACACCCCACCCGCTTCCCAG 
GCGTGACCTGTCAACAGCAACTTCGCGGTGTGGTGAACTCTCTGAGGAAAAACCATTTTGATTATTACTCTCAGACGTGCGTGG 

35 CAACAAGTGACTGAGACCTAGAAATCCAAGCGTTGGAGGTCCTGAGGCCAGCCTAAGTCGCTTCAAMTGGAACGAAGGCGTTT 
GTGGGGTTCCATTCAGAGCCGATACATCAGCATGAGTGTGTGGACAAGCCCACGGAGACTTGTGGAGCTGGCAGGGCAGAGCCT 
GCTGAAGGATGAGGCCCTGGCCATTGCCGCCCTGGAGTTGCTGCCCAGGGAGCTCTTCCCGCCACTCTTCATGGCAGCCTTTGA 
CGGGAGACACAGCCAGACCCTGAAGGCAATGGTGCAGGCCTGGCCCTTCACCTGCCTCCCTCTGGGAGTGCTGATGAAGGGACA 
ACATCTTCACCTGGAGACCTTCAAAGCTGTGCTTGATGGACTTGATGTGCTCCTTGCCCAGGAGGTTCGCCCCAGGAGGTGGAA 

40 ACTTCAAGTGCTGGATTTACGGAAGAACTCTCATCAGGACTTCTGGACTGTATGGTCTGGAAACAGGGCCAGTCTGTACTCATT 
TCCAGAGCCAGAAGCAGCTCAGCCCATGACAAAGAAGCGAAAAGTAGATGGTTTGAGCACAGAGGCAGAGCAGCCCTTCATTCC 
AGTAGAGGTGCTCGTAGACCTGTTCCTCAAGGAAGGTGCCTGTGATGAATTGTTCTCCTACCTCATTGAGAAAGTGAAGCGAAA 
GAAAAATGTACTACGCCTGTGCTGTAAGAAGCTGAAGATTTTTGCAATGCCCATGCAGGATATCAAGATGATCCTGAAAATGGT 
GCAGCTGGACTCTATTGAAGATTTGGAAGTGACTTGTACCTGGAAGCTACCCACCTTGGCGAAATTTTCTCCTTACCTGGGCCA 

45 GATGATTAATCTGCGTAGACTCCTCCTCTCCCACATCCATGCATCTTCCTACATTTCCCCGGAGAAGGAAGAGCAGTATATCGC 
CCAGTTCACCTCTCAGTTCCTCAGTCTGCAGTGCCTGCAGGCTCTCTATGTGGACTCTTTATTTTTCCTTAGAGGCCGCCTGGA 
TCAGTTGCTCAGGCACGTGATGAACCCCTTGGAAACCCTCTCAATAACTAACTGCCGGCTTTCGGAAGGGGATGTGATGCATCT 
GTCCCAGAGTCCCAGCGTCAGTCAGCTAAGTGTCCTGAGTCTAAGTGGGGTCATGCTGACCGATGTAAGTCCCGAGCCCCTCCA 
AGCTCTGCTGGAGAGAGCCTCTGCCACCCTCCAGGACCTGGTCTTTGATGAGTGTGGGATCACGGATGATCAGCTCCTTGCCCT 

50 CCTGCCTTCCCTGAGCCACTGCTCCCAGCTTACAACCTTAAGCTTCTACGGGAATTCCATCTCCATATCTGCCTTGCAGAGTCT 
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CCTGCAGGACCTCATCGGGCTGAGCAATCTGACCCACGTGCTGTATCCTGTCCCCCTGGAGAGTTATGAGGACATCCATGGTAC 
CCTCCACCTGGAGAGGCTTGCCTATCTGCATGCCAGGCTCAGGGAGTTGCTGTGTGAGTTGGGGCGGCCCAGCATGGTCTGGCT 
TAGTGCCAACCCCTGTCCTCACTGTGGGGACAGAACCTTCTATGACCCGGAGCCCATCCTGTGCCCCTGTTTCATGCCTAACTA 
GCTGGGTGCACATATCAAATGCTTCATTCTGCATACTTGGACACTAAAGCCAGGATGTGCATGCATCTTGAAGCAACAAAGCAG 
5 CCACAGTTTCAGACAAATGTTCAGTGTGAGTGAGGAAAACATGTTCAGTGAGGAAAAAACATTCAGACAAATGTTCAGTGAGGA 
AAAAAAGGGGAAGTTGGGGATAGGCAGATGTTGACTTGAGGAGTTAATGTGATCTTTGGGGAGATACATCTTATAGAGTTAGAA 
ATAGAATCTGAATTTCTAAAGGGAGATTCTGGCTTGGGAAGTACATGTAGGAGTTAATCCCTGTGTAGACTGTTGTAAAGAAAC 
TGTTGAAAATAAAGAGAAGCAATGTGAAGCAAAAAAAAAAAAAAAAAA 

89 

10 MMDSGGPGGPSPSERDRQYCELCGKMENLLRCSRCRSSFYCCKEHQRQDWKKHKLVCQGSEGALGHGVGPHQHSGPAPPAAVP 
PPRAGARE PRKAAARRDNASGDAAKGKVKAKPPADPAAAAS PCRAAAGGQGSAVAAEAE PGKEEPPARS SLFQEKANLYPPSNT 
PGDALSPGGGLRPNGQTKPLPALKLALEYIVPCMNKHGICWDDFLGKETGQQIGDEVRALHDTGKFTDGQLVSQKSDSSKDIR 
GDKITWIEGKEPGCETIGLLMSSMDDLIRHCNGK^ 

KVSGGILRIFPEGKAQFADIEPKFDRLLFFWSDRRNPHEVQPAYATRYAITVWYFDADERARAKWYLTGEKGW 
15 VGKDVF 

90 

TTAGGGGCAGAAAAACATTTGTAATAATTAATGGCTTTGAGAGACACAAGGCTTTGTTTGCCCCAGAGTATTAGTTAACCCACC 
TAGTGCTCCTAATCATACAATATTAAGGATTGGGAGGGACATTCATTGCCTCACTCTCTATTTGTTTCACCTTCTGTAAAATTG 
GTAGAATAATAGTACCCACTTCATAGCATTGTATGATGATTAAATTGGTTAATATTTTTAAAATGCTTAGAACACAGATTGGG^ 
20 ACATMCAGCAAGCACCACATGTGTTTATAAGATAMTTCCTTTGTGTTGCCTTCCGTTAMGTTTAMTAAGTAAATAMT^ 
ATAAATACTTGCATGACATTTTGAAGTCTCTCTATAACATCTGAGTAAGTGGCGGCTGCGACAATGCTACTGGAGTTCCAGAAT 
CGTGTTGGTGACAAGATTGTTCACCAGCATATGGTGTGGTGAAAACTCACTAATTTGGAATTAGTTCAGATTATTAAGCCTGAA 
TAGGTGAAMTCCTGAMTCAAGGATCTTTGGAACTATO 

GCAAGTGTTCTATTTGATGGATTAAGTATATTTAGGATATACATGTTCAATTTGTGATTTTGTATACTTAATTGGAACAAGAAA 
25 GCTAATAMGGTTTTGATATGGACATCTATTCTTTTMGTAMCTTCMTGAAAATATATGAGTAGAGCATATAGAGATGTAA^ 
TAATTTGTGGACACACCACAGACTGAAATAGCAMTTTAAAAGAAATTGTTGGAAGAATCAAGTGTTTGTGGAATGAGTCCTCC 
TAGTAAAGTTCCTGCTCTTGTGAATAATTMGCCTCATGTATAATTACTATAGCAAMGGAAGCCTAAGAAGTATTAGACTCTA 
CTTGTATTTAMTTACATTTTACATAATTTATGTGTATGAAAAATGTTTTAAATGCTTATTTTCGTAAGCCATGAGATAGCTCC 
TTTATATTTTAAGAATTTCTGAATTAATTTGCTTGGATTTTATTAGTGCAAATGGCAGAGCTAGCAATTCCTTTTTCTGTGTTC 
30 CCATTCCATCCTATTCATCCCTCTTTTAGGAAACTCTGAACTCTGGATTGTCCTTGTTTACATACCTGCCTCCTGCATTGGACT 
ATGTGTCTCTGAGTGTAGTATGACTAATTCATT^ 

CTGCAACMTGACATTTTACTGTATTTAATAAAGCTCTGGGAAAGTAGGATACACATAAGACAGGTCTAGGTCTAAATTCTTTA 
CAGAAACTTGGATTTTTAGTTCGGTTTGAAATTTGMGATGTGAGTATATTTATCTCAGTTTCCCAMGGACAAGCTMTTGG^ 
ATTATCATCCTCTTTCACTTGATTGGATCCCCAGAATGCCATTTACGCATGCAGCAGGATTTTATAACAGTTTTAAATTCTGTA 

35 TATTTGATGAAGAGGTTTTATATTTTTGGATTCAAGCCTCTTTTTAAACTTCTACAATATGGTTTACAATAATTCCTTATATCC 
TGCTTTTGAAATACATATTACAACTTTTTAAGTTTGGAAGGCTATATTTCAAGGACTGAAGTTACAGTATACTCAAGTGATACA 
CAAGCCTAGCACCCCACTTTCCACATAGTGTTCGATAAAGATTGATAAACTCGAAATCACAGACCTTTTAATTCTTAAGACAAA 
TAGCAGCAGAAAGAAACATCTTTGGCTTATTTCTGGTAAGGTTTTTATGCTCTGTAAAACAAAGAATTGTATTCATCCGCGCAG 
CACAGATTCTATTAAAAATAAATGTGAGAGTCGTTAATGTAGTACTGCTCATTTACCATCAAAATTCACT 

40 CCATCAGTTTAAATTGGATATTGGAATGAGCATTGATTACATTTAACTTGGTAGCCCAAMTTTCTTCATGGGGTTTTGAACTC 
GGCGGGATTTCAAAGGTTTTAAAAATGAGTTTTT 

GGGGCGCAAGAGATTGGATTAAGACCATAGTAATACTTATTTTGTTCTTAACCATTTCAGGGCTTCTTGAAATAGAGGCTGTAT 
GGTGTAATGGAAAAAACAGCCTTGGAATCTGGGAGCCTGATTCCTGGATTCAGTCCCAGTTTTGCGTGACCTTGGGCAAGTTAC 
TTTACTTCTCTGAATTTCCGTTTCCTCCTCTGCAAAATGAGGATCGCAATAGCCACCTTGCAACCTTGACTGGAGCGAGCCTCG 

45 CACACCCCGCGCCGGCCTGGAGGAAGAGCAGCCATGATTACGCCGCCTTCGCTCCGCTACCCGCTTGCGGCTGGCGCCCTCCTC 
CAGCAGGTCTAGGCGCTGCCGCGCTGCCCCACGCCTTTCCGCCGCTCGCGGGCCTGCGCCTCGGCGTCCCCGAGGAGGCCGCTG 
CGGGCTGAGGTAGCGCACCGGCCTCTCGGCGTCCCAGTCCGGTCCCGGGCGGAGGGAAAGCGGGCGACCCACCTCCGAGGCAGA 
AGCCGAGGCCCGGCCCCGCCGAGTGCGGAGGAGCGCAGGCAGCCCCCGCCCCTCGGCCCTCCCCCCGGCCCTCCCGGCCCTCCC 
TCCGCCCCCTCCGCCCTCGCGCGCCGCCCGCCCGGGTCGCCGCGGGGCCGTGGTGTACGTGCAGAGCGCGCAGAGCGAGTGGCG 

50 CCCGTATGCCCTGCGCTCCTCCACAGCCTGGGCCGGGCCGCCCGGGACGCTGAGGCGGCGGCGGCGGCCGAGGGGGCCGGTCTT 
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GCGCTCCCCAGGCCCGCGCGCCTGAGCCCAGGTTGCCATTCGCCGCACAGGCCCTATTCTCTCAGCCCTCGGCGGCGATGAGGC 
GCTGAGGCGGCTGCCGGCGCTGCGCCGGAGCTTAGGACTCGGAAGCGGCCGGGCCGAGGGCGTGGGGTGCCGGCCTCCCTGAGG 
CGAGGGTAGCGGGTGCATGGCGCAGTAACGGCCCCTATCTCTCTCCCCGCTCCCCAGCCTCGGGCGAGGCCGTCCGGCCGCTAC 
CCCTCCTGCTCGGCCGCCGCAGTCGCCGTCGCCGCCGCCGCCGCCGCCATGGCCAATGACAGCGGCGGGCCCGGCGGGCCGAGC 
5 CCGAGCGAGCGAGACCGGCAGTACTGCGAGCTGTGCGGGAAGATGGAGAACCTGCTGCGCTGCAGCCGCTGCCGCAGCTCCTTC 
TACTGCTGCAAGGAGCACCAGCGTCAGGACTGGAAGAAGCACAAGCTCGTGTGCCAGGGCAGCGAGGGCGCCCTCGGCCACGGA 
GTGGGCCCACACCAGCATTCCGGCCCCGCGCCGCCGGCTGCAGTGCCGCCGCCCAGGGCCGGGGCCCGGGAGCCCAGGAAGGCA 
GCGGCGCGCCGGGACAACGCCTCCGGGGACGCGGCCAAGGGAAAAGTAAAGGCCAAGCCCCCGGCCGACCCAGCGGCGGCCGCG 
TCGCCGTGTCGTGCGGCCGCCGGCGGCCAGGGCTCGGCGGTGGCTGCCGAAGCCGAGCCCGGCAAGGAGGAGCCGCCGGCCCGC 

10 TCATCGCTGTTCCAGGAGAAGGCGAACCTGTACCCCCCAAGCAACACGCCCGGGGATGCGCTGAGCCCCGGCGGCGGCCTGCGG 
CCCAACGGGCAGACGAAGCCCCTGCCGGCGCTGAAGCTGGCGCTCGAGTACATCGTGCCGTGCATGAACAAGCACGGCATCTGT 
GTGGTGGACGACTTCCTCGGCAAGGAGACCGGACAGCAGATCGGCGACGAGGTGCGCGCCCTGCACGACACCGGGAAGTTCACG 
GACGGGCAGCTGGTCAGCCAGAAGAGTGACTCGTCCAAGGACATCCGAGGCGATAAGATCACCTGGATCGAGGGCAAGGAGCCC 
GGCTGCGAAACCATTGGGCTGCTCATGAGCAGCATGGACGACCTGATACGCCACTGTAACGGGAAGCTGGGCAGCTACAAAATC 

15 AATGGCCGGACGAAAGCCATGGTTGCTTGTTATCCGGGCAATGGAACGGGTTATGTACGTCATGTTGATAATCCAAATGGAGAT 
GGAAGATGTGTGACATGTATATATTATCTTAATAAAGACTGGGATGCCAAGGTAAGTGGAGGTATACTTCGAATTTTTCCAGAA 
GGCAAAGCCCAGTTTGCTGACATTGAACCCAAATTTGATAGACTGCTGTTTTTCTGGTCTGACCGTCGCAACCCTCATGAAGTA 
CAACCAGCATATGCTACMGGTACGCAATAACTGTTTGGTATTTTGATGCAGATGAGAGAGCACGAGCTAMGTAAAATATCTA 
ACAGGTGAAAAAGGTGTGAGGGTTGAACTCAATAAACCTTCAGATTCGGTCGGTAAAGACGTCTTCTAGAGCCTTTGATCCAGC 

20 AATACCCCACTTCACCTACAATATTGTTAACTATTTGTTAACTTGTGAATACGMTAAATGGGATAAAGAAAAATAGACMCCA 
GTTCGCATTTTMTMGGAAACAGAMCMCTTTTTC^ 

GATCAACTTCAAATCTGTGATTGCTTACAGGAGGAAGATAAGCTACTT^ATTGAAAATGGTTTTTACATCTGGATATGAAATAAG 
TGCCCTGTGTAGAATTTTTTTCATTCTTATATTTTGCCAGATCTGTTATCTAGCTGAGTTCATTTCATCTCTCCCTTTTTTATA 
TCAAGTTTGAATTTGGGATAATTTTTCTATATTAG 
25 AAAATAACATCAATCTATTTTTGTAAACCTGTTCATACTATTAAATTTTGCCCTAAAAGACCTCTTAATAATGATTGTTGCCAG 
TGACTGATGATTAATTTTATTTTACTTAAAATAAGAAAAGGAGCACTTTAATTACAACTGAAAAATCAGATTGTTTTGCAGTCC 
TTCCTTACACTAATTTGMCTCTTAMGATTGCTC^ 
CATTTACTACCAATAACTTTTAGTTAATGTTTTACAAGG 

91 

.30 MLRTAMGLRSWLAAPWGALPPRPPLLLLLLL^^ 

GAREALFALSSWLSFLPGGEYQELLWGADAEKKQQCSFKGKDPQRDGQNYIKILLPLSGSHIiFTCGTAAFSPMCTYINMENFTL 
ARDEKG3WIjLEDGKGRCPFDPNFKSTALWDGELYTGTVSSFQGNDPAISRSQSLRPTKTESSL}MLQDPAFVASAYIPESLGS 
LQGDDDKIYFFFSETGQEFEFFENTIVSRIARICKGDEGGERVLQQRWTSFLKAQLLCSRPDDGFPFNVLQDVFTLSPSPQDWR 
DTLFYGVFTSQWHRGTTEGSAVCVFTMKDVQRVFSGLYKEVNRETQQWYTVTHPVPTPRPGACITNSARERKINSSLQLPDRVL 

35 NFLKDHFLMDGQVRSRMLLLQPQARYQRVAVHRVPGLHHTYDVLFLGTGDGRLHKAVSVGPRVHIIEELQIFSSGQPVQNLLLD 
THRGLLYMSHSGWQVPMMCSLY^ 
VPTGEKPCEQVQFQPNTVNTLACPLLSNM 

EWEDGVADQTDEGGSVPVIISTSRVSAPAGGI^SWGADRSYWKEFLVMCTLFVLAVLLPVLFLLYRHRNSMKVFLKQGECASV 
HPKTCPWLPPETRPLNGLGPPSTPLDHRGYQSLSDSPPGARVFTESEKRPL'SIQDSFVEVSPVCPRPRVRLGSEIRDSW 

40 92 

GCTCTGCCCAAGCCGAGGCTGCGGGGCCGGCGCCGGCGGGAGGACTGCGGTGCCCCGCGGAGGGGCTGAGTTTGCCAGGGCCCA 
CTTGACCCTGTTTCCCACCTCCCGCCCCCCAGGTCCGGAGGCGGGGGCCCCCGGGGCGACTCGGGGGCGGACCGCGGGGCGGAG 
CTGCCGCCCGTGAGTCCGGCCGAGCCACCTGAGCCCGAGCCGCGGGACACCGTCGCTCCTGCTCTCCGAATGCTGCGCACCGCG 
ATGGGCCTGAGGAGCTGGCTCGCCGCCCCATGGGGCGCGCTGCCGCCTCGGCCACCGCTGCTGCTGCTCCTGCTGCTGCTGCTd 

45 CTGCTGCAGCCGCCGCCTCCGACCTGGGCGCTCAGCCCCCGGATCAGCCTGCCTCTGGGCTCTGAAGAGCGGCCATTCCTCAGA 
TTCGAAGCTGAACACATCTCCAACTACACAGCCCTTCTGCTGAGCAGGGATGGCAGGACCCTGTACGTGGGTGCTCGAGAGGCC 
CTCTTTGCACTCAGTAGCAACCTCAGCTTCCTGCCAGGCGGGGAGTACCAGGAGCTGCTTTGGGGTGCAGACGCAGAGAAGAAA 
CAGCAGTGCAGCTTCAAGGGCAAGGACCCACAGCGCGACTGTCAAAACTACATCAAGATCCTCCTGCCGCTGAGCGGCAGTCAC 
• CTGTTCACCTGTGGCACAGCAGCCTTCAGCCCCATGTGTACCTACATCAACATGGAGAACTTCACCCTGGCAAGGGACGAGAAG 

50 GGGAATGTCCTCCTGGAAGATGGCAAGGGCCGTTGTCCCTTCGACCCGAATTTCAAGTCCACTGCCCTGGTGGTTGATGGCGAG 
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CTCTACACTGGAACAGTCAGCAGCTTCCAAGGGAATGACCCGGCCATCTCGCGGAGCCAAAGCCTTCGCCCCACCAAGACCGAG 
AGCTCCCTCAACTGGCTGCAAGACCCAGCTTTTGTGGCCTCAGCCTACATTCCTGAGAGCCTGGGCAGCTTGCAAGGCGATGAT 
GACAAGATCTACTTTTTCTTCAGCGAGACTGGCCAGGAATTTGAGTTCTTTGAGAACACCATTGTGTCCCGCATTGCCCGCATC 
TGCAAGGGCGATGAGGGTGGAGAGCGGGTGCTACAGCAGCGCTGGACCTCCTTCCTCAAGGCCCAGCTGCTGTGCTCACGGCCC 
5 GACGATGGCTTCCCCTTCAACGTGCTGCAGGATGTCTTCACGCTGAGCCCCAGCCCCCAGGACTGGCGTGACACCCTTTTCTAT 
GGGGTCTTCACTTCCCAGTGGCACAGGGGAACTACAGAAGGCTCTGCCGTCTGTGTCTTCACAATGAAGGATGTGCAGAGAGTC 
TTCAGCGGCCTCTACAAGGAGGTGAACCGTGAGACACAGCAGTGGTACACCGTGACCCACCCGGTGCCCACACCCCGGCCTGGA 
GCGTGCATCACCAACAGTGCCCGGGAAAGGAAGATCAACTCATCCCTGCAGCTCCCAGACCGCGTGCTGAACTTCCTCAAGGAC 
CACTTCCTGATGGACGGGCAGGTCCGAAGCCGCATGCTGCTGCTGCAGCCCCAGGCTCGCTACCAGCGCGTGGCTGTACACCGC 

10 GTCCCTGGCCTGCACCACACCTACGATGTCCTCTTCCTGGGCACTGGTGACGGCCGGCTCCACAAGGCAGTGAGCGTGGGCCCC 
CGGGTGCACATCATTGAGGAGCTGCAGATCTTCTCATGGGGACAGCCCGTGCAGAATCTGCTCCTGGACACCCACAGGGGGCTG 
• CTGTATGCGGCCTCACACTCGGGCGTAGTCCAGGTGCCCATGGCCAACTGCAGCCTGTACAGGAGCTGTGGGGACTGCCTCCTC 
GCCCGGGACCCCTACTGTGCTTGGAGCGGCTCCAGCTGCAAGCACGTCAGCCTCTACCAGCCTCAGCTGGCCACCAGGCCGTGG 
ATCCAGGACATCGAGGGAGCCAGCGCCAAGGACCTTTGCAGCGCGTCTTCGGTTGTGTCCCCGTCTTTTGTACCAACAGGGGAG 

15 AAGCCATGTGAGCAAGTCCAGTTCCAGCCCAACACAGTGAACACTTTGGCCTGCCCGCTCCTCTCCAACCTGGCGACCCGACTC 
TGGCTACGCAACGGGGCCCCCGTCAATGCCTCGGCCTCCTGCCACGTGCTACCCACTGGGGACCTGCTGCTGGTGGGCACCCAA 
CAGCTGGGGGAGTTCCAGTGCTGGTCACTAGAGGAGGGCTTCCAGCAGC.TGGTAGCCAGCTACTGCCCAGAGGTGGTGGAGGAC 
GGGGTGGCAGACCAAACAGATGAGGGTGGCAGTGTACCCGTCATTATCAGCACATCGCGTGTGAGTGCACCAGCTGGTGGCAAG 
GCCAGCTGGGGTGCAGACAGGTCCTACTGGAAGGAGTTCCTGGTGATGTGCACGCTCTTTGTGCTGGCCGTGCTGCTCCCAGTT 

20 TTATTCTTGCTCTACCGGCACCGGAACAGCATGAAAGTCTTCCTGAAGCAGGGGGAATGTGCCAGCGTGCACCCCAAGACCTGC 
CCTGTGGTGCTGCCCCCTGAGACCCGCCCACTCAACGGCCTAGGGCeCCCTAGCACCCCGCTCGATCACCGAGGGTACCAGTCC 
CTGTCAGACAGCCCCCCGGGGTCCCGAGTCTTCACTGAGTCAGAGAAGAGGCCACTCAGCATCCAAGACAGCTTCGTGGAGGTA 
TCCCCAGTGTGCCCCCGGCCCCGGGTCCGCCTTGGCTCGGAGATCCGTGACTCTGTGGTGTGAGAGCTGACTTCCAGAGGACGC 
TGCCCTGGCTTCAGGGGCTGTGAATGCTCGGAGAGGGTCAACTGGACCTCCCCTCCGCTCTGCTCTTCGTGGAACACGACCGTG 

25 GTGCCCGGCCCTTGGGAGCCTTGGGGCCAGCTGGCCTGCTGCTCTCCAGTCAAGTAGCGAAGCTCCTACCACCCAGACACCCAA 
ACAGCCGTGGCCCCAGAGGTCCTGGCCAAATATGGGGGCCTGCCTAGGTTGGTGGAACAGTGCTCCTTATGTAAACTGAGCCCT 
TTGTTTAAAAAACAATTCCAAATGTGAAACTAGAATGAGAGGGAAGAGATAGCATGGCATGCAGCACACACGGCTGCTCCAGTT 
CATGGCCTCCCAGGGGTGCTGGGGATGCATCCAAAGTGGTTGTCTGAGACAGAGTTGGAAACCCTCACCAACTGGCCTCTTCAC 
CTTCCACATTATCCCGCTGCCACCGGCTGCCCTG^CTCACTGCAGATTCAGGACCAGCTTGGGCTGCGTGCGTTCTGCCTTGCC 

30 AGTCAGCCGAGGATGTAGTTGTTGCTGCCGTCGTCCCACCACCTCAGGGACCAGAGGGCTAGGTTGGCACTGCGGCCCTCACCA 
GGTCCTGGGCTCGGACCCAACTCCTGGACCTTTCCAGCCTGTATCAGGCTGTGGCCACACGAGAGGACAGCGGGAGCTCAGGAG 
AGATTTCGTGACAATGTACGCCTTTCCCTCAGAATTCAGGGAAGAGACTGTCGCCTGCCTTGCTCCGTTGTTGCGTGAGAACeC 
GTGTGCCCCTTCCCACCATATCCACCCTCGCTCCATCTTTGAACTCAAACACGAGGAACTAACTGCACCCTGGTCCTCTCCCCA 
GTCCCCAGTTCACCCTCCATCCCTCACCTTCCTCCACTCTAAGGGATATCAACACTGCCCAGCACAGGGGCCCTGAATTTATGT 

35 GGTTTTTATACATTTTTTAATAAGATGCACTTTATGTCATTTTTTAATAAAGTCTGAAGAATTACTGTTT 

92a 

GGCACGAGGCCGCTTGCGGGTGAGCTCTGCCCAAGCCGAGGCTGCGGGGCCCGTGCTCCGGCGAACGCCCACTCGAGACGGGTT' 
CGGCTCCGACGCCCCGCGGCGCCGGCGGGAGGACTGCGGTGCCCCGCGGAGGGGCTGAGTTTGCCAGCCGCGGCCGCCCTCCTG 
ACGCCACGGGGCGCCTCCCCGACTCAAACGGTGGGCCCACTTGACCCTGTTTCCCACCTCCCGCCCCCCAGGTCCGGAGGCGCC 

40 CGGGTGAACTGGGACAAAGGGTGGAGGGCGGGGGGTCCAGGCCTCCGCGGGGCCCCCGGGGCGACTCGGGGGCGGACCGCGGGG 
CGGAGCTGCCGCCCCCCCGGGGGCCCCGCTGAGCCCCCGCCTGGCGCCCCGCCTCGACGGCGGGGTGAGTCCGGCCGAGCCACC 
TGAGCCCGAGCCGCGGGACACCGTCGCTCCCACTCAGGCCGGCTCGGTGGACTCGGGCTCGGCGCCCTGTGGCAGCGAGGTGCT 
CTCCGAATGCTGCGCACCGCGATGGGCCTGAGGAGCTGGCTCGCCGACGAGAGGCTTACGACGCGTGGCGCTACCCGGACTCCT 
- CGACCGAGCGGCCCCCATGGGGCGCGCTGCCGCCTCGGCCACCGCTGCTGCTGCTCCTGCTGGGGGTACCCCGCGCGACGGCGG 

45 AGCCGGTGGCGACGACGACGAGGACGACCTGCTGCTCCTGCTGCAGCCGCCGCCTCCGACCTGGGCGCTCAGCCCCCGGACGAC 
GAGGACGACGTCGGCGGCGGAGGCTGGACCCGCGAGTCGGGGGCGATCAGGCTGCCTCTGGGCTCTGAAGAGCGGCCATTCCTC 
AGATTCGAAGCTAGTCGGACGGAGACCCGAGACTTCTCGCCGGTAAGGAGTCTAAGCTTCCTGAACACATCTCCAACTACACAG 
CCCTTCTGCTGAGCAGGGATGGCAGGGACTTGTGTAGAGGTTGATGTGTCGGGAAGACGACTCGTCCCTACCGTCCACCCTGTA 
CGTGGGTGCTCGAGAGGCCCTCTTTGCACTCAGTAGCAACCTTGGGACATGCACCCACGAGCTCTCCGGGAGAAACGTGAGTCA 

50 TCGTTGGACAGCTTCCTGCCAGGCGGGGAGTACCAGGAGCTGCTTTGGGGTGCAGACGGTCGAAGGACGGTCCGCCCCTCATGG - 
TCCTCGACGAAACCCCACGTCTGCCAGAGAAGAAACAGCAGTGCAGCTTCAAGGGCAAGGACCCACAGCGCGACGTCTCTTCTT 
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TGTCGTCACGTCGAAGTTCCCGTTCCTGGGTGTCGCGCTGTGTCAAAACTACATCAAGATCCTCCTGCCGCTCAGCGGCAGTCA 
CCTGTTACAGTTTTGATGTAGTTCTAGGAGGACGGCGAGTCGCCGTCAGTGGACAACACCTGTGGCACAGCAGCCTTCAGCCCC 
ATGTGTACCTACAT.CAACATGGGTGGACACCGTGTCGTCGGAAGTCGGGGTACACATGGATGTAGTTGTACCAGAACTTCACCC 
TGGCAAGGGACGAGAAGGGGAATGTCCTCCTGGAAGATTCTTGAAGTGGGACCGTTCCCTGCTCTTCCCCTTACAGGAGGACCT 
5 TCTAGGCAAGGGCCGTTGTCCCTTCGACCCGAATTTCAAGTCCACTGCCCTGGTCCGTTCCCGGCAACAGGGAAGCTGGGCTTA 
AAGTTCAGGTGACGGGACCAGGTTGATGGCGAGCTCTACACTGGAACAGTCAGCAGCTTCCAAGGGAATGCCAACTACCGCTCG 
AGATGTGACCTTGTCAGTCGTCGAAGGTTCCCTTACACCCGGCCATCTCGCGGAGCCAAAGCCTTCGCCCCACCAAGACCGAGA 
GCTGGGCCGGTAGAGCGCCTCGGTTTCGGAAGCGGGGTGGTTCTGGCTGTCGTCCCTCAACTGGCTGCAAGACCCAGCTTTTGT 
GGCCTCAGCCTACATTCCAGGGAGTTGACCGACGTTCTGGGTCGAAAACACCGGAGTCGGATGTAAGGTGAGAGCCTGGGCAGC 

10 TTGCAAGGCGATGATGACAAGATCTACTTTTTCTACTCTCGGACCCGTCGAACGTTCCGCTACTACTGTTCTAGATGAAAAAGA 
TCAGCGAGACTGGCCAGGAATTTGAGTTCTTTGAGAACACCATTGTGTCCAGTCGCTCTGACCGGTCCTTAAACTCAAGAAACT 
CTTGTGGTAACACAGGCGCATTGCCCGCATCTGCAAGGGCGATGAGGGTGGAGAGCGGGTGCTACAGCGTAACGGGCGTAGACG 
TTCCCGCTACTCCCACCTCTCGCCCACGATGTGCAGCGCTGGACCTCCTTCCTCAAGGCCCAGCTGCTGTGCTCACGGCCCGCG 
TCGCGACCTGGAGGAAGGAGTTCCGGGTGGACGACACGAGTGCCGGGCACGATGGCTTCCCCTTCAACGTGCTGCAGGATGTCT 

15 TCACGCTGAGCCCCTGCTACCGAAGGGGAAGTTGCACGACGTCCTACAGAAGTGCGACTCGGGGAGCCCCCAGGACTGGCGTGA 
. CACCCTTTTCTATGGGGTCTTCACTTCCCATCGGGGGTCCTGACCGCACTGTGGGAAAAGATACCCCAGAAGTGAAGGGTGTGG 
CACAGGGGAACTACAGAAGGCTCTGCCGTCTGTGTCTTCACAATGACACCGTGTCCCCTTGATGTCTTCCGAGACGGCAGACAC 
AGAAGTGTTACTAGGATGTGCAGAGAGTCTTCAGCGGCCTCTACAAGGAGGTGAACCGTGAGTCCTACACGTCTCTCAGAAGTC 
GCCGGAGATGTTCCTCCACTTGGCACTCACACAGCAGTGGTACACCGTGACCCACCCGGTGCCCACACCCCGGCCTGGTGTGTC 

20 GTCACCATGTGGCACTGGGTGGGCCACGGGTGTGGGGCCGGACCAGCGTGCATCACCAACAGTGCCCGGGAAAGGAAGATCAAC 
TCATCCCTGCTCGCACGTAGTGGTTGTCACGGGCCCTTTCCTTCTAGTTGAGTAGGGACGAGCTCCCAGACCGCGTGCTGAACT 
TCCTCAAGGACCACTTCCTGATGGACTCGAGGGTCTGGCGCACGACTTGAAGGAGTTCCTGGTGAAGGACTACCTGGGGCAGGT 
CCGAAGCCGCATGCTGCTGCTGCAGCCCCAGGCTCGCTACCACCCGTCCAGGCTTCGGCGTACGACGACGACGTCGGGGTCCGA 
GCGATGGTGCGCGTGGCTGTACACCGCGTCCCTGGCCTGCACCACACCTACGATGTCCCGCGCACCGACATGTGGCGCAGGGAC 

25 CGGACGTGGTGTGGATGCTACAGGTCTTCCTGGGCACTGGTGACGGCCGGCTCCACAAGGCAGTGAGCGTGGGCAGAAGGACCC 
GTGACCACTGCCGGCCGAGGTGTTCCGTCACTCGCACCCGCCCCGGGTGCACATCATTGAGGAGCTGCAGATCTTCTCATCGGG 
ACAGCCGGGGCCCACGTGTAGTAACTCCTCGACGTCTAGAAGAGTAGCCCTGTCGGCGTGCAGAATCTGCTCCTGGACACCCAC 
AGGGGGCTGCTGTATGCGGCCTGCACGTCTTAGACGAGGACCTGTGGGTGTCCGCCGACGACATACGCCGGACACACTCGGGCG 
TAGTCCAGGTGCCCATGGCCAACTGCAGCCTGTACCGGGTGTGAGCCCGCATCAGGTCCACGGGTACCGGTTGACGTCGGACAT 

30 GGCCAGCTGTGGGGACTGCCTCCTCGCCCGGGACCCCTACTGTGCTTGGAGCGGTCGACACCCCTGACGGAGGAGCGGGCCCTG 
GGGATGACACGAACCTCGCCCTCCAGCTGTAAGCACGTCAGCCTCTACCAGCCTCAGCTGGCCACCAGGCGAGGTCGACATTCG 
TGCAGTCGGAGATGGTCGGAGTCGACCGGTGGTCCGCGTGGATCCAGGACATCGAGGGAGCCAGCGCCAAGGACCTTTGCAGCG 
CGGCACCTAGGTCCTGTAGCTCCCTCGGTCGCGGTTCCTGGAAACGTCGCGCTCTTCGGTTGTGTCCCCGTCTTTTGTACCAAC 
AGGGGAGAAGCCATGTGAAGAAGCCAACACAGGGGCAGAAAACATGGTTGTCCCCTCTTCGGTACACTGCAAGTCCAGTTCCAG 

35 CCCAACACAGTGAACACTTTGGCCTGCCCGCTCCCGTTCAGGTCAAGGTCGGGTTGTGTCACTTGTGAAACCGGACGGGCGAGG 
TCTCCAACCTGGCGACCCGACTCTGGCTACGCAACGGGGCCCCCGTCAATAGAGGTTGGACCGCTGGGCTGAGACCGATGCGTT 
GCCCCGGGGGCAGTTAGCCTCGGCCTCCTGCCACGTGCTACCCACTGGGGACCTGCTGCTGGTGGGCGGAGCCGGAGGACGGTG 
CACGATGGGTGACCCCTGGACGACGACCACCCCACCCAACAGCTGGGGGAGTTCCAGTGCTGGTCACTAGAGGAGGGCTTCCGT 
GGGTTGTCGACCCCCTCAAGGTCACGACCAGTGATCTCCTCCCGAAGGAGCAGCTGGTAGCCAGCTACTGCCCAGAGGTGGTGG 

40 AGGACGGGGTGGCATCGTCGACCATCGGTCGATGACGGGTCTCCACCACCTCCTGCCCCACCGTGACCAAACAGATGAGGGTGG 
CAGTGTACCCGTCATTATCAGCACATCGCGCTGGTTTGTCTACTCCCACCGTCACATGGGCAGTAATAGTCGTGTAGCGCTGTG 
AGTGCACCAGCTGGTGGCAAGGCCAGCTGGGGTGCAGACAGGTCCTACACTCACGTGGTCGACCACCGTTCCGGTCGACCCCAC 
GTCTGTCCAGGAACTGGAAGGAGTTCCTGGTGATGTGCACGCTCTTTGTGCTGGCCGTGCTGTGACCTTCCTCAAGGACCACTA 
CACGTGCGAGAAACACGACCGGCACGACCTCCCAGTTTTATTCTTGCTCTACCGGCACCGGAACAGCATGAAAGTCTTGAGGGT 

45 CAAAATAAGAACGAGATGGCCGTGGCCTTGTCGTACTTTCAGAACCTGAAGCAGGGGGAATGTGCCAGCGTGCACCCCAAGACC 
TGCCCTGTGGGGACTTCGTCCCCCTTACACGGTCGCACGTGGGGTTCTGGACGGGACACCTGCTGCCCCCTGAGACCCGCCCAC 
TCAACGGCCTAGGGCCCCCTAGCACCACGACGGGGGACTCTGGGCGGGTGAGTTGCCGGATCCCGGGGGATCGTGGCCACTCGA 
TCACCGAGGGTACCAGTCCCTGTCAGACAGCCCCCCGGGGGCGGTGAGCTAGTGGCTCCCATGGTCAGGGACAGTCTGTCGGGG 
GGCCCCCGCCGAGTCTTCACTGAGTCAGAGAAGAGGCCACTCAGCATCCAAGACAGCTGGCTCAGAAGTGACTCAGTCTCTTCT 

50 . CCGGTGAGTCGTAGGTTCTGTCGATCGTGGAGGTATCCCCAGTGTGCCCCCGGCCCCGGGTCCGCCTTGGCTCGAGCACCTCCA 
TAGGGGTCACACGGGGGCCGGGGCCCAGGCGGAACCGAGCGAGATCCGTGACTCTGTGGTGTGAGAGCTGACTTCCAGAGGACG 
CTGCCCCTCTAGGCACTGAGACACCACACTCTCGACTGAAGGTCTCCTGCGACGGGTGGCTTCAGGGGCTGTGAATGCTCGGAG 
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AGGGTCAACTGGACCTCCCCTCACCGAAGTCCCCGACACTTACGAGCCTCTCCCAGTTGACCTGGAGGGGAGCGCTCTGCTCTT 
CGTGGAACACGACCGTGGTGCCCGGCCCTTGGGAGCCTGCGAGACGAGAAGCACCTTGTGCTGGCACCACGGGGCGGGAACCCT 
CGGATGGAGCCAGCTGGCCTGCTGCTCTCCAGTCAAGTAGCGAAGCTCCTAGCAAGCTCGGTCGACCGGACGACGAGAGGTCAG 
TTCATCGCTTCGAGGATGGTCCCAGACACCCAAACAGCCGTGGCCCCAGAGGTCCTGGCCAAATATGGGGGGGTCTGTGGGTTT 
5 GTCGGCACCGGGGTCTCCAGGACCGGTTTATACCCCGCCTGCCTAGGTTGGTGGAACAGTGCTCCTTATGTAAACTGAGCCCTT 
TGCGGACGGATCCMCCACCTTGTCACGAGGAATACATTTGACTCGGGAAACTTTAGAAAACAATTCCAAATGTGAAACTAGAA 
• TGAGAGGGAAGAGATAGCAAATCTTTTGTTAAGGTTTACACTTTGATCTTACTCTCCCTTCTCTATCGATGGCATGCAGCACAC 
ACGGCTGCTCCAGTTCATGGCCTCCCAGGGGTGCTACCGTACGTCGTGTGTGCCGACGAGGTCAAGTACCGGAGGGTCCCCACG 
TGGGGATGCATCCAAAGTGGTTGTCTGAGACAGAGTTGGAAACCCTCACCACCCCTACGTAGGTTTCACCAACAGACTCTGTCT 

10 CAACCTTTGGGAGTGGAACTGGCCTCTTCACCTTCCACATTATCCCGCTGCCACCGGCTGCCCTGTTTGACCGGAGAAGTGGAA 
GGTGTAATAGGGCGACGGTGGCCGACGGGACACTCACTGCAGATTCAGGACCAGCTTGGGCTGCGTGCGTTCTGCCTTGCCAGA 
GTGACGTCTAAGTCCTGGTCGAACCCGACGCACGCAAGACGGAACGGTGTCAGCCGAGGATGTAGTTGTTGCTGCCGTCGTCCC 
ACCACCTCAGGGACCAGTCGGCTCCTACATCAACAACGACGGCAGCAGGGTGGTGGAGTCCCTGCAGAGGGCTAGGTTGGCACT 
GCGGCCCTCACCAGGTCCTGGGCTCGGACCGTCTCCCGATCCAACCGTGACGCCGGGAGTGGTCCAGGACCCGAGCCTGGCAAC 

15 TCCTGGACCTTTCCAGCCTGTATCAGGCTGTGGCCACACGAGAGGAGTTGAGGACCTGGAAAGGTCGGACATAGTCCGACACCG 
GTGTGCTCTCCTCAGCGCGAGCTCAGGAGAGATTTCGTGACAATGTACGCCTTTCCCTCAGAGTCGCGCTCGAGTCCTCTCTAA 
AGCACTGTTACATGCGGAAAGGGAGTCTATTCAGGGAAGAGACTGTCGCCTGCCTTCCTCCGTTGTTGCGTGAGAACCTAAGTC 
CCTTCTCTGACAGCGGACGGAAGGAGGCAACAACGCACTCTTGGCGTGTGCCCCTTCCCACCATATCCACCCTCGCTCCATCTT 
TGAACTCAAAGCACACGGGGAAGGGTGGTA^AGGTGGGAGCGAGGTAGAAACTTGAGTTTCACGAGGAACTAACTGCACCCTGG 

20 TCCTCTCCCCAGTCCCCAGTTCACCCGTGCTCCTTGATTGACGTGGGACCAGGAGAGGGGTCAGGGGTCAAGTGGGTCCATCCC 
TCACCTTCCTCCACTCTAAGGGATATCAACACTGCCCAGCACAGGTAGGGAGTGGAAGGAGGTGAGATTCCCTATAGTTGTGAC 
GGGTCGTGAGGGGCCCTGAATTTATGTGGTTTTTATATATTTTTTAATAAGATGCACTTCCCCGGGACTTAAATACACCAAAAA 
TATATAAAAAATTATTCTACGTGATTATGTC 

AAAATTATTTCAGACTTCTTAATGAGAAATTTTTTTTTTTAAAAAAACTTTTTTTG 
25 93 

MVGYDPKPDGRNNTKFQVAVAGSVSGLVTRALISPFDVIKIRFQLQHERLSRSDPSAKYHGILQASRQILQEEGPTAFWKGHVP 
AQILSIGYGAVQFLSFEMLTELVHRGSVYDAREFSVHFVCGGLAACMATLTVHPVDVLRTRFAAQGEPKVYNTLRHAVGTMYRS 
EGPQVFYKGLAPTLIAIFPYAGLQFSCYSSLKHLYKWAIPAEGKKNENLQNLLCGSGAGVISKTLTYPLDLFKKRLQVGGFEHA 
RAAFGQWRYKGLMDCAKQVLQKEGALGFFKGLSPSLLKMLSTGFMFFSYEFFCNVFHCMNRTASQR 

30 94 

ATGGTTGGCTATGACCCCAAACCAGATGGCAGGAATAACACCAAGTTCCAGGTGGCAGTGGCTGGGTCTGTGTCTGGACTTGTT 
. ACTCGGGCGCTGATCAGTCCCTTCGACGTCATCAAGATCCGTTTCCAGCTTCAGCATGAGCGCCTGTCTCGCAGTGACCCCAGC 
GCAAAGTACCATGGCATCCTCCAGGCCTCTAGGCAGATTCTGCAGGAGGAGGGTCCGACAGCTTTCTGGAAAGGACACGTCCCA 
GCTCAGATTCTCTCCATAGGCTATGGAGCTGTCCAATTCTTGTCATTTGAAATGCTGACGGAGCTGGTCCACAGAGGCAGCGTG 

35 TACGACGCCCGGGAATTCTCAGTGCACTTTGTATGTGGTGGCCTGGCTGCCTGTATGGCCACCCTCACTGTGCACCCCGTGGAT 
GTTCTGCGCACCCGCTTTGCAGCTCAGGGTGAGCCCAAGGTCTATAATACGCTGCGCCACGCCGTGGGGACCATGTATAGGAGC 
GAAGGCCCCCAGGTTTTCTACAAAGGCTTGGCTCCCACCTTGATCGCCATCTTCCCCTACGCCGGGCTGCAGTTCTCTTGCTAC 
AGCTCCTTGAAGCACCTGTACAAGTGGGCCATACCAGCCGAAGGAMGAAAAATGAGAACCTCCAAAACCTGCTTTGTGGCAGT 
GGAGCTGGTGTCATCAGCAAGACCCTGACATATCCGCTGGACCTCTTCAAGAAGCGGCTACAGGTTGGAGGGTTTGAGCATGCC 

40 AGAGCTGCCTTTGGCCAGGTACGGAGATACAAGGGCCTCATGGACTGTGCCAAGCAGGTGCTACAAAAGGAAGGCGCCCTGGGC 
TTCTTCAAGGGCCTGTCCCCCAGCTTGCTGAAGGCTGCCCTCTCCACAGGCTTCATGTTCTTCTCGTATGAATTCTTCTGTAAT 
. GTC.TTCCACTGCATGAACAGGACAGCCAGCCAGCGCTGA 

95 

MELALRRSPVPRWLLLLPLLLGLNAGAVIDWPTEEGKEW 
45 FGNFEEIGPLDSDLKPRKTTWLQAASLLFVDNPW^ 

GGKMAAGIGLELYKAIQRGTIKCNFAGVALGDSWISPVDSVLSWGPYLYSMSLLEDKGLAEVSKVAEQVLNAWKGLYR 
WGKAEMIIEQNTDGWFYNILTKSTPTSTMESSLEFTQSHLVCLCQRHVRHLQRDALSQLMGPIRKKLKIIPEDQSWGGQATN 
VFVmEEDFMKPVISIVDELLEAGIWTVYNGQLDLIVDTMGQEAWVRKLKWPEL 
L AF YWI LKAGHMVP SDQGDMALKMMRL VTQQE . 
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96 

CCTGTTGCTGATGCTGCCGTGCGGTACTTGTCATGGAGCTGGCACTGCGGCGCTCTCCCGTCCCGCGGTGGTTGCTGCTGCTGC 
C.GCTGCTGCTGGGCCTGAACGCAGGAGCTGTCATTGACTGGCCCACAGAGGAGGGCAAGGAAGTATGGGATTATGTGACGGTCC 
GCAAGGATGCCTACATGTTCTGGTGGCTCTATTATGCCACCAACTCCTGCAAGAACTTCTCAGAACTGCCCCTGGTCATGTGGC 
5 TTCAGGGCGGTCCAGGCGGTTCTAGCACTGGATTTGGAAACTTTGAGGAAATTGGGCCCCTTGACAGTGATCTCAAACCACGGA 
AAACCACCTGGCTCCAGGCTGCCAGTCTCCTATTTGTGGATAATCCCGTGGGCACTGGGTTCAGTTATGTGAATGGTAGTGGTG 
CCTATGCCAAGGACCTGGCTATGGTGGCTTCAGACATGATGGTTCTCCTGAAGACCTTCTTCAGTTGCCACAAAGAATTCCAGA 
CAGTTCCATTCTACATTTTCTCAGAGTCCTATGGAGGAAAAATGGCAGCTGGCATTGGTCTAGAGCTTTATAAGGCCATTCAGC 
GAGGGACCATCAAGTGCAACTTTGCGGGGGTTGCCTTGGGTGATTCCTGGATCTCCCCCGTGGATTCGGTGCTCTCCTGGGGAC 

10 CTTACCTGTACAGCATGTCTCTTCTCGAAGACAAAGGTCTGGCAGAGGTGTCTAAGGTTGCAGAGCAAGTACTGAATGCCGTAA 
ATAAGGGGCTCTACAGAGAGGCCACAGAGCTGTGGGGGAAAGCAGAAATGATCATTGAACAGAACACAGATGGGGTGAACTTCT 
ATAACATCTTAACTAAAAGCACTCCCACGTCTACAATGGAGTCGAGTCTAGAATTCACACAGAGCCACCTAGTTTGTCTTTGTC 
AGCGCCACGTGAGACACCTACAACGAGATGCCTTAAGCCAGCTCATGAATGGCCCCATCAGAAAGAAGCTCAAAATTATTCCTG 
AGGATCAATCCTGGGGAGGCCAGGCTACCAACGTCTTTGTGAACATGGAGGAGGACTTCATGAAGCCAGTCATTAGCATTGTGG 

15 ACGAGTTGCTGGAGGCAGGGATCAACGTGACGGTGTATAATGGACAGCTGGATCTCATCGTAGATACCATGGGTCAGGAGGCCT 
GGGTGCGGAAACTGAAGTGGCCAGAACTGCCTAAATTCAGTCAGCTGAAGTGGAAGGCCCTGTACAGTGACCCTAAATCTTTGG 
AAACATCTGCTTTTGTCAAGTCCTACAAGAACCTTGCTTTCTACTGGATTCTGAAAGCTGGTCATATGGTTCCTTCTGACCAAG 
GGGACATGGCTCTGAAGATGATGAGACTGGTGACTCAGCAAGAATAGGATGGATGGGGCTGGAGATGAGCTGGTTTGGCCTTGG 
GGCACAGAGCTGAGCTGAGGCCGCTGAAGCTGTAGGAAGCGCCATTCTTCCCTGTATCTAACTGGGGCTGTGATCAAGAAGGTT 

20 CTGACCAGCTTCTGCAGAGGATAAAATCATTGTCTCTGGAGGCAATTTGGAAATTATTTCTGCTTCTTAAAAAAACCTAAGATT 
TTTTAAAAAATTGATTTGTTTTGATCAAAATAAAGGATGATAATAGATATTATTTTTTCTTATGACAGAAGCAAATGATGTGAT 
TTATAGAAAAACTGGGAAATACAGGTACCCAAAGAGTAAATCAACATCTGTATACCCCCTTCCCAGGGGTAAGCACTGTTACCA 
ATTTAGCATATGTCCTTGCAGAATTTTTTTTTCTATATATACATATATATTTTTTACCAAAATGAATCATTACTCTATGTTGTT 
TTACTATTTGTTTGACATATCAGTATATCTGAAACACCTTTTCATGTCAATAAATGTTCTTCTCTAACATTAA 

25 98 

AATAGTTCATTAATTAGTCTTTATTATGTGAAAAGACTTTTCTAAAATTCTTCCATMTAATTGAAGATATAAATTACAA 

TAAAATTTACAAAAGMTAAAAGCATTAAMTAAAGTGTGAAAAATTTAAAAAGGGTAAAGATGAAAATTA^ 

CTCAAAAATCCGAAAGTGATTTTATCAATGTCCTAAGAAAATT 

TGCCGGGCTGGGGGCGGGAGCTACGGCTTCTCTGGGGACGCGGAAGCGAGAAGCAGGGACCTTGGGGCGCGCCCGGTTTTCCGG 

30 GACGGTGCCCCGGCCCCCAGCCCACGCCTCCGGCCGCTGGCATGGTGCTGCTTGCTGGGACCCGGCCGCAGGGTGGCTGGGAGC 
GCTGCATGACCCCGCCACCGCCGTCCCCACTCCTAGGCGCACAGGTCGAGGAGGACCGCGCTGACTACAAAGAGTTCCAGGGCC 
CCACACCTGCAGCATAGCCTGGGACGACTCTGTACCCTTTCCAGGACGAGGAGGAGCACGGCGTCGAGGGCTTGGAGAGCGTCC 
CGGAGGAGGGCGTCCAGGAGGCATGGGGCTGCTGCGGACGCTGGTGCGGCGAGGGGTGAGCGTTGAGAAGGCGCAGGAGACTGA 
CCACAATGGCTGGACCAGCCTTAATGTCGCCTGCTACCACGGCTTTGTGGATATACCGTGGTGGCCTTAGCTGAGTAACCCCAC 

35 ACTGACGTCAACTGGCAGGACAGGGAGGGGAACGCAGCCCTAAATACAGCTGCACAGGCAGGAGCTGCCTCTGGCCCCATGGGC 
ACCGTTGCCTTGGACACTGGCTCTCAGCCTTGTGCTTCCTGGTCAGCATCACGTGGGCTGGTTCATTTGAGCTTTAACACAGAT 
CTGGTGGAAACACAGGCTGCCAGGCCCTGCCCCTGGAGTCTTTCTTTGAATAGGCCTGGGGTGGGGCCCAAGAATGAGCAGAAG 
AACAGGTTTCCTGGTGAGGCTGATGCCGCTGGCCCAGGGTTCTCACGTTGAGGACCGAGAGCTTTGAGGTTTTCATACCTGATG 
ATGTCCAGGAACCCTTCTCAGTAGGCACTGAAGACCATCAGCAGAATCACAGACCCCAGGAGAGATGTCGTCAGACAGACACAG 

40 AGGCATCACCGAATTAAAGTGAAAATGAAGAAAGGAGCTGAGCATCTGTTTCATGACTTTCGTGGCTGTTTTACTAAAGAGGCT 
ATCCTGGCCCAGTCAGGGCACGCTATCATCACCAACTACTTGTTGAACTACGTTCAGGGTCTTGACCTTGAAGGAGGGACGCGT 
TCGGGTTGAAAGCCGCCATGCAGGGTCGAACGGATCGCATCGGAGCCCTGAGCTAGCAGGGGCGGGTGTCCACGCGAGGGCCCC 
CGCCGTGGGATGTCGCCAGAGGAGTGGCCACTTATACGTCCGCCTCATGCAGAGGCTCCTGGAGCGCCCCTGCCGGGAGCTGTT 
GGGGAAAAAGTACCAGCTTGAGCTGCCGCCGCTCCCCGAGAGGCCGCAAAAGCCCGAGGGCTCCAAGAACTGCCTGCAGAGGCT 

45 CAGGGACTGCGAGCTGTCCGCGCTGACGCCGCGCTCCATGCGGGGCCCAGAGGACCGGGGCGCCCTGGACCACATGGTCAGGAT 
GACCACGAGCGTCTAACAGCCCCGCCGGGGCCCTCGCGTGCCAGACTATGTGCCCCGAGAGCACCTTGTGCTGG 

99 

MAPMGIRLSPLGVAVFCLLGLGVLYHL^ 

RESWLHEKSKGKTREGAEDKMTSGDVLSNRKMFYLLKTAFPSVQINTEEHVDAADQEVILWDHKIPEDILKEVTTPKEVPAES 
50 VTWIDPLDATQEYTEDLRKWTTMVCV^^ 
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ALQTFGNQTTIIPAGGAGYKVLALLDVPDKSQEKADLYIHVTYIKKWDICAGNAILKALGGHMTTLSGEEISYTGSDGIEGGLL 
ASIRMNHQALVRKLPDLEKTGHK 

100 

CACCTCGCGGCCGTAGGCTAACGTGGAAGTCGGACCAGCCGGCCGGCGGAAGAACCTAGAGCGCGCTGCCTGGCGAGTCAGGCG 
5 CGCGGGGCGGCGTTGGTGGTCTTCGCGGCGTAACTCGGCCTTTCCTGGGAGGGAGTGATGGGGCGCACCGGGGCCGGGGAGCGG 
GCGCCAGTGTAGCCCGCGCGGCGCCTGGCCCGGAGCGCGGCGGCTGCGGCGGCGGCGGCGGCGGGCGCTGGAGGCCTGTGAGAG 
CCGCAGCCCGGAGCGCCCGGCTTCCCACGCCATGGCCCCCATGGGCATCCGCCTTTCCCCACTGGGGGTGGCAGTGTTTTGTCT 
GCTGGGGCTCGGCGTGCTCTACCACCTCTACTCGGGCTTCTTGGCCGGCCGCTTCAGCCTCTTCGGCCTGGGCGGCGAGCCTGG 
CGGCGGCGCGGGGGGGCCCGCGGCCGCGGCCGATGGGGGCACCGTGGACTTGCGCGAGATGCTGGCTGTGTCAGTGCTGGCCGC 

10 AGTCCGCGGCGGCGACGAGGTGAGGCGCGTCCGCGAGAGCAACGTCCTCCACGAGAAGTCCAAGGGGAAGACGCGCGAGGGAGC 
CGAGGACAAGATGACCAGCGGCGACGTGCTGTCCAACCGCAAGATGTTCTACCTGCTCAAGACCGCCTTCCCCAGCGTCCAGAT 
TAATACTGAGGAACACGTGGATGCAGCTGATCAGGAGGTTATCTTGTGGGATCATAAGATTCCTGAGGATATCCTAAAGGAAGT 
AACTACTCCTAAAGAGGTACCAGCAGAAAGTGTTACTGTCTGGATTGACCCACTTGATGCTACACAGGAATATACAGAGGATCT ■ 
TCGAAAGTACGTCACTACTATGGTGTGTGTGGCTGTAAATGGTAAACCCATGCTAGGAGTTATACATAAGCCATTTTCCGAATA 

15 TACAGCTTGGGCAATGGTAGATGGTGGTTCAAATGTGAAAGCCCGCTCTTCCTACAATGAGAAGAGCCCAAGGATGGTTGTGTC 
TCGTTCCCATTCAGGGATGGTCAAACAGGTCGCTCTTCAGACTTTTGGAAACCAGACTACAATTATCCCAGCTGGTGGTGCTGG 
TTATAAAGTTTTAGCACTTTTGGATGTGCCTGATAAGAGTCAAGAAAAAGCTGATTTATACATCCATGTGACATACATCAAAAA 
GTGGGATATATGTGCTGGTAATGCCATCTTAAAAGCCCTAGGGGGGCATATGACTACCCTGAGTGGTGAAGAAATCAGTTACAC 
TGGTTCAGACGGCATTGAAGGGGGACTCCTTGCTAGCATCAGAATGAACCACCAGGCCCTGGTCAGAAAACTCCCAGATCTAGA 

20 AAAGACAGGACATAAATGAGCATAACTGATTACAGGGTACAGTTCTTCACAGCTGAAATGGTTAGCCTGAGATGCTGGAAGCTT 
CAAAGGATTGGTGGAGACTATGCATGGTTAAGGCCATCCCGAACTTTTTAAAGTATTTATGAAGCATCAGAGACTTATTTTCCC 
TGTAATAGAATGCAAAATCAGGGAAAATGGGTTGCTTTGTGTCTCAAGTATTGTCTTTATTTTTGAGACTATTTTCATACAGTT 
GTCATACACAAGGCGCATATATATATTTGTGAATTAAAATCTGTAGCTGAGTCTAAAAAAAAAAAAAAAAAA 

102 

25 TTTTTTTTTATTTGGATTCTATTTATTTAAAATTAAATTCTTTGCAAAATTGAACTTCTCAACTAAAACGTGTCCATGTCAGAA 
TTTTAACTGTTAGCAGGTAGTTTGTGGCAAAGATC 

TTTTTTCTGTTGAGACTATCATTATTTGTCTTATTACCCAAGAGGCAATTACCTGAATTTGGATGTCTGAATTATAACTTATGC 
AGGAATAGTTCTGTAAATACATTTAAATAAACTGTAAAGATATTTAATAAATATAGTATTTATACTAATCTGTGTGC 
GGTTTGAATAGTAACTAAATGAGACACCAGCCCTTGACATTGAGTTTGTTGGTCACTATCAGGTCCTCATTTCCAAGCCTCCTA 
30 GTCATTCTAGCACTGATTATATGCTGCTACTTTAACTGGCTCCAGCTGCTTCACTACATCAGTTTAGCTTCCTCAGAAATTCAT 
CAAAATGGACGGACAATTAAATGTAAATTATAGAACTTTTTCCCAGCTGAGGCTTTGCACCTTCCGTATAGTATAGAGGGAAGC 
TACAAAAA • 

103 

MPIVDFHLQKLPAAAAAPFLDKSHSSSPGSPKTAELFFTERTPWFHQFPQIPVAPDLSSRGTRRKDFLSTRRILGAAKVPWSF 
35 YHSRGVLLKASERASGGSKAVQPGRKEKETPRHRKPRFSRRKNR 

104 

AATATTGCAGAGCAACTTGAAAGGGGGAGCTGGCCGCGTAGGAGGTGAGAGCCAGCCGCAAAAATCCGTTCGGCGGCATTTTCT 
CTTCAAACCCGCCCGGTCGGATGTGGCATTTGAAGAAAGAAGGCGTGGGTCAATAATCAACCCGCACTTTCTCCTCCAAACTGC 
TGACGCGACTCTCACCGCCTTCTCAGCTAAGCCAGCTGCCGGGAAGGCCAGCTCTCCGTAAGAACCTGCCAGGGGAAGGGATTC 

40 TCTGGGCCAAGGGGACCGAGGGCTCCCGAGCAAGGGCTGGGGCGCCCTGAAAGCATACCTTTCCACTGGCGCCAAGACCCTGCC 
GGGCAAAACGCGGGGACCAGACCTGAGGAGCCGTGGGAGCCCAGGGAGATCTGGGGCGGGAGCTTCGTTCCGGCCAGGGCCTCT 
ATCCGCCCCGATACGTGGGTCCTGGGCCGCGGGCCTCAGCGGGCAGAAGTGGGGCTCCGACCTCAAGATGAAGGTCCAGCGTCC 
TGGCATGCAGATGGTCGGTGCCCCAGCCTCAAGCCGCTCTGCTTAAATCCAAAGGACTGGCTTTGGAGCAAGCGTAGGCGAGAT 
TATTTTAAAATAATAATTTATGCTGGCGGAAGGTTTTTATTTGTTGGTTTGCTATTGGTGTTTTGTTTTTTAAGCGAAAATGCC 

45 GATTGTTGATTTCCACCTCCAGAAATTACCAGCCGCCGCCGCTGCGCCCTTTCTAGACAAATCCCACAGCAGTTCACCCGGCTC 
CCCCAAAACCGCAGAGCTTTTCTTTACTGAGAGGACCCCTTGGTTCCACCAATTCCCCCAAATCCCCGTGGCTCCCGATCTCTC 
ATCCAGGGGGACTAGAAGAAAAGATTTTCTCAGCACTAGAAGAATACTTGGAGCAGCAAAGGTGCCCGTGGTGTCTTTCTACCA 
TTCAAGGGGTGTCCTCCTGAAGGCGTCTGAGAGGGCCTCTGGCGGTTCCAAGGCTGTGCAGCCGGGGAGGAAGGAAAAGGAAAC 
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GCCAAGACATAGAAAACCACGCTTTTCCCGTAGGAAGAACCGATGATGAGCCCTGATGAAAGAAGGAAGAAGACCCGCTGTCTG 
CGAAGGCCTAAAGGTTAGAAATACAGCAATTTTTCTCATGCCGAAGGAAAGGCAAAATGGTCTAAGAAACTGCAAGAGTGTGAT 
ACAAGTTTTCCTCAACAGCGAGCTGGGACTGGCGACAATTTCTTAACTCACAGCCTCCGCAGTCGCCTTAGTCGACAAAGAGAG 
CAGCCCACGTAGCGGCCGCCGCGCCCGTGCCCGTGGCCTTTACTTGCTTTTGCCCTGAAGTTCAGTGTAGTGGTTTGTTTGGGG 
5 CTTGCAGGAAGAAACAAACACACTCCCTCTGACTCTCCACAACCCATCTTTGCCTTCTGCAACCGCTTCTCCTGCCCATCGCTG 
GTTTTGGGGTAGGCTCTTTGTCTGCCTGAGGCCCCGCACTCGCAGGGCAAGAAGCTTAAAGGAATCCAACTCCAAACTCCTCCG 
AAGTTAAATATTGACCAGAACCTCTTTCAACAGGCCGCGGTTGCCAGGAACCGTGGAGGGCCAACTCCTCCCAACCGCCCTGGT 
GCAAAGTCCCACGCGGCGAAGAGTTTTGGAGCAGCGCTTACCTAGAAAGATGTTTAAATTCTGAACCAGGAATTGTCTCCAACT 
CCAGGCGCTCAGGGAATCGCCTTTTCCGGTGTCCAGGCGCTCTGCAGACAAATAAACAGCAGAAGGTGAATGGCGGCTGCGAGG 
10 GAAGGGAGTTGGAGGCCGGGAGTGGGAAGCGAGGCTCATGCGGGCGCCAAGGTGGCCCTGGGGGTGACGCGAACATCAGGGAGT 
CACCAGGGCACTCGTCCGTGTGGCGCACTCATTGCTCGAAATGTTCCAGTCTACAAATGGTCCGAAGATGGTGATTTATTTTTG 
TAAAATAAATACATACTCCATCTCCACACCC 

106 

TTTTTTTTCCTAGTTTCATTAAAGATGGAGTTC 
15 CACCCATTAAGTTAAAAACTGGAATTTTACCTTTCCAAACTTATTTCCTACCTTG'CCCCTTTTCATGTTAATACCATGAACTAT 
TCAGTAATTCTCTTCAAGCTCCATAATTTCCTACTTTCTACTTTTGCTCATTATTTCCATTCCAAGGGAAAGTTCTGTGTGCGT 
GTCTATTTCTATGTTCAAATTACACCTGTAAGATCCAACTCACAGGGCCATTGCCCTCTAGGGGTCTCTCCTGGATCCACTTCA 
ACAGGAGGTAATTTTATCTNCACCGGATACCATTTCCTCCTTGGACACTTACCTCTTTTMGGGGACTCCCGAGGGTTGAGGGG 
GNG 

20 108 

GCTTACCTGGAAAACTGTAACAATCTCCAGGTTGACCTTTCTGTCTCCCTCTAAACACATCACATTGGCTAAAATTATCTTTTG 
TTAATAAGGATTATATTACATATAAAAGTATTTAAATATTTTAATGACTCTTACCCTTAGGATAGGGTCTGAAOTCCTTAGCAG 
GCTTCATGATATATTAACCTTGTTGACCTCTTTGGTCTCTCTCAGTACTTCTCAATTCATACTTTGCTCTTCAGCCACAGTAAG 
TTAGTTTTAGTTCCCTGAGTATGTTATCAGCTCTCTGGATTCTGGGCTTTGCTCATATTTCCTCTATTCAAAATACTCTTATAC 
25 TTCATGTCTTTCTTGCCACACCCCCCACCCCTTCCCACCAGCATATGTGTATGTATCACAATTACTCATCCTTCAGATCAGATT 
CAGTTTAAATGGCATTTCTTTGGGAAGCATTTCCTGATTCTCTACCTCCCAGGCTGGGTTGATTGCATGTATAGCACCTTCTAC 
TGTAGTACCCTTTTCATGGCATTTACTTACCATAGCCTATTTGTTATTGCCCTCAAGCTCCATTAGGGCAAATTGTTTTTTCAT 
TGTTTTAAAGTCAGCACCTAAATCTCCCTGGCC 

110 

30 TCTCTACAMTMCTATMCAAMCTTTT^ 

AAAGCTGTGAGTTTCTTCTTACATAAAACTGGTACCTAAGCAAGTGAGGGTTCATTTTATTTTTCACTCACCAATCCCCATATC 
ATTATACAGTAACACCATACAGCCAAAACGGCCATGATATTCCTCCCTTCTCAGCCAAAATTGGGCAAGAGAGAATGACCCTTG 
TAGGGGAAMGAAACCTCTACGATAMCTGAAATGCCACCATC 
TAGCTGCTAATGAGTTTCTATGGTTCCAGNTGGGGTAGTTCAAAGTA 

35 112 

TAAAAAGCATTAGGCATATAAATGTATAAATATATTTTATCATGTACAGTACAAAAATGGAACCTTATGCATGGGCCTTAGGAA 
TACAGGCTAGTATTTCAGCACAGACTTCCCTGCTTGAGTTCTTGCTGATGCTTGCACCGTGACAGTGGGCACCAACACAGACGT 
GCCACCCAACCCCCTGCACACACCACCGGCCACCAGGGGCCCCCTTGTGCGCCTTGGCTTTATAACTCCTCTGGGGGTGATATT 
GGTGGTGATCACAGCTCCTAGCATAATGAGAGTTCCATTTGGTATTGTCACACGTCTCCTGCCTCGCTTGGGTTGCCATGTTTG 

40 AGCGATGGCCCTGTTGATTTCACCCTGCCTTTTACTGAATCTGTAAATTGTTGTGCAATTGTGGTTATAGTAGACTGTAGCACA 
TTGCCTTTTCTAAACTGCTACATGTTTATAATCTTCATTTTTAAAGTATGTGTAATTTTTTTAAGTATGTATTCTATTCATATG 
GTCTGCTTGTCAGTGAGCCAGACTTGCTTACTATATTCCTTTATAATAATGCTAGCCACTTCCTGGATTCTTTAGTAATGTGCT 
GCATGCAAGAACTTTCCAGTAGCAGTGAAGGAGGGCTGCCTCTCCAAGCTTCCTAAGGGATGCTGCCCTGTGTGGGGATGCATT 
GCAGAGGCACTAGTAGCATGGGGGCTAGAGTGGGGAGCGAGATGTAAAAGGGTGGGGGGATAGGAGAATTCCAGAGTGCTTCCA 

45 GCATTAGGGTCCTGAGAACTTCTGAGTTCAGAGAAACATGCAAAGTGACTAACAAAATAGCTACTTACCTTTGCAGTTCTACAG 
ACCCTGGGAGCTGCTTTGGGAGTGAGAAAGGCAACCCTCCAATGTGTTTCAACTTTAAAATGTTGAATTCTTTTCAGACATGGT 
ATCTCATTTATTCTCCTTTTCTAGCGTTTGTTGAATTTCAGGCAGAATGTCTTACAGACTGTCCTAGAACCAGATTATCATTTA 
ATCTGAAACAGCTGAGGAAGGGACAGAGAAGGTACAAGGGQAAGGCAGCACAAAACAGATCAGGAGAATGAAGAGGGAATGCTT 
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TGGTTTTTTGTTTTGTTTTGTTTTTTCTTTTTCAAGTAACTAAAACAACATCTACATGTAGAGTGTTGTGGAGAGCTGAGACCA 
GGGTAAAGTCAAGTGCAGCATCAGTACTGCGAGACCCACCAGCCCCTGGAGAGGGTCAGCCGAGAATCTGGTAGTGAAGCCTGT 
CTAGGGTCCCGGCACCCTCACCCTCAGCCACCTGCAGAGAGGCCAGGGCCCCAGAGACTAGCCTGGTTCTGAAGTGGGCAGGGG 
TGCTGCCAGAGCCCTCTGCCCCTTATGTTGAGACCCTGCTTTCAGGACAGGCCAGCCGTTGGCCACCATGTCACATTCTGAGTG 
5 AGTGTCACAGGTCCCTAACAATAATTTTCTGATCTGGAGCATATCAGCAGAATGCTTAGCCTCAAGGGGCCTGGAAGCTGTAAT 
GTTTGATTTATGATGAGAACTATCCGAGGCCACCCTTGGCCTCTAAATAAGCTGCTCTAGGGAGCCGCCTACTTTTTGATGAGA 
AATTAGAAGAGTACCTAATGTTGAAAACATGACATGCGCTCTTGGGATCTGCTGTTCTCTCCAGGGCTCCAGAACCTGATACCT 
GTTACCAAAGCTAGGAAAGAGCTTTATCACAAGCCTTCACTGTCCTGGCATGAGAACTGGCTGCCAGGCTCAGTGTACCCCATT 
AACTGTGAATGAATCTGAGCTTGGTTTCCTTTATTGCTTCCTCTGCAATATGATTGCTGAAACACATTTTAAAAATTCAGAAGC 

10 TTGTCACTCCTGTTAATGGGAGGATCAGTCACACATGTGTAGTACAAGGCGGACTTTGTGTTTGTTTTTGGTGTTAATTTTTAG 
CATTGTGTGTGTTGCTTCCCCACCCTGAGGAGAGGACACCATGGCTTACTACTCAGGACAAGTATGCCCCGCTCAGGGTGTGAT 
TTCAGGTGGCTTCCAAACTTGTACGCAGTTTAAAGATGGTGGGGACAGACTTTGCCTCTACCTAGTGAACCCCACTTAAAGAAT 
AAGGAGCATTTGAATCTCTTGGAAAAGGCCATGAAGAATAAAGCAGTCAAAAAGAAGTCCTCCATGTTGGTGCCAAGGACTTGC 
GAGGGGAAATAAAAATGTTATCCAGCCTGACCAACATGGAGAAACCCCGTCTCCATTAAAAATACAAAATTAGCCTGGCATGGT 

15 GGCGCATGCCTGTAATCCCAGCTACTCTGGAGGCTGAGGCAGGAGAATCGCTTGAACCCAGGAGGCGGAGGTCGCAGTGAGCCG 
AGATCATGCCAGTGCACTCCAGCCTGGGTAACAAGAGTGAAACTCCGTGTCAAAAAAAAAAAAAAAAAATGTTACTCATCCTCT 
CTGAAAGCAAAAAGGAAACCCTAACAGCTCTGAACTCTGGTTTTATTTTTCTTGCTGTATTTGGGTGAACATTGTATGATTAGG 
CATAATGTTAAAAAAAAAAAATTTTTTTTTGGTAGAAATGCAATCACCAGTAAAGAGGTACGAAAAAGCTAGCCTCTCTCAGAG 
ACCGGGGAGGCAGAGTACTACTAGAGGAAGTGAAGTTCTGATGGAATCATGCCTGTCAAATGAGGTCTTGAAGCGGATGCCCAA 

20 ATAAAAGAGTATATTTTATCTAAATCTTAAGTGGGTAACATTTTATGCAGTTTAAATGAATGGAATATTTTCCTCTTGTTTAGT 
TGTATCTGTTTGTATTTTTCTTTGATGAATGATTGGTCATGAGGCCTCTTGCCACACTCCAGAAATACGTGTGCGGCTGCTTTT 
AAGAACTATGTGTCTGGTCACTTATTTCTCTAAAATTATCTCATTGCCTGGCAATCAGTCTTCTCTTGTATACTTGTCCTAGCA 
CATTATGTACATGGGAAATGTAAACAAATGTGAAGGAGGACCAGAAAAATTAGTTAATATTTAAAAAAATGTATTGTGCATTTT 
GGCTTCACATGTTTAACTTTTTTTAAGAAAAAAGTTGCATGAATGGAAAAAAAAATCTGTATACAGTATCTGTAAAAACTATCT 

25 TATCTGTTTCAATTCCTTGCTCATATCCCATATAATCTAGAACTAAATATGGTGTGTGGCCATATTTAAACACCTGAGAGTCAA 
GCAGTTGAGACTTTGATTTGAAGCACCTCATCCTTCTTTCAATGCGAACACTATCATATGGCATTCTTACTGAGGATTTTGTCT 
AACCATATGTTGCCATGAATTAACTCTGCCGCCTTTCTTAAGGATCAAAACCAGTTTGATTTGGGAATCTTCCCCTTTCCAAAT 
GAAATAGAGATGCAGTACTTAACTTTCCTTGGTGTTTGTAGATATTGCCTTGTGTATTCCACTTAAAACCGTAATCTAGTTTGT 
AAAAGAGATGGTGACGCATGTAAATAAAGCATCAGTGACACTCT 

30 113 

MGTDSRAAKALLARARTLHLQTGNLLM^ 

EREEMKVSAKLFIVESNSSSSTRSAVDMACSVLGVAQLDSVIIASPPIEDGVNLSLEHLQPYWEELENLVQSKKIVAIGTSDLD 
KTQLEQLYQWAQVKPNSNQVNLASCCVMPPDLTAFAKQFDIQLLTHNDPKELLSEASFQEALQESIPDIQAHEWVPLWLLRYSV 
IVKSRGIIKSKGYILQAKRRGS 

35 114 

GGCACGAGGCTGCGGCCGCAGTAGCCGGAGCCGGAGCCGCAGCCACCGGTGCCTTCCTTTCCCGCCGCCGCCCAGCCGCCGTCC 
GGCCTCCCTCGGGCCCGAGCGCAGACCAGGCTCCAGCCGCGCGGCGCCGGCAGCCTCGCGCTCCCTCTCGGGTCTCTCTCGGGC 
CTCGGGCACCGCGTCCTGTGGGCGGCCGCCTGCCTGCCCGCCCGCCCGCAGCCCCTTGCCTGCCGGCCCCTGGGCGGCCCGTGC 
CATGGGCACCGACAGCCGCGCGGCCAAGGCGCTCCTGGCGCGGGCCCGGACCCTGCACCTGCAGACGGGGAACCTGCTGAACTG 

40 GGGCCGCCTGCGGAAGAAGTGCCCGTCCACGCACAGCGAGGAGCTTCATGATTGTATCCAAAAAACCTTGAATGAATGGAGTTC- 
CCAAATCAACCCAGATTTGGTCAGGGAGTTTCCAGATGTCTTGGAATGCACTGTATCTCATGCAGTAGAAAAGATAAATCCTGA 
TGAAAGAGAAGAAATGAAAGTTTCTGCAAAACTGTTCATTGTAGAATCAAACTCTTCATCATCAACTAGAAGTGCAGTTGACAT 
GGCCTGTTCAGTCCTTGGAGTTGCACAGCTGGATTCTGTGATCATTGCTTCACCTCCTATTGAAGATGGAGTTAATCTTTCCTT 
GGAGCATTTACAGCCTTACTGGGAGGAATTAGAAAACTTAGTTCAGAGCAAAAAGATTGTTGCCATAGGTACCTCTGATCTAGA 

45 CAAAACACAGTTGGAACAGCTGTATCAGTGGGCACAGGTAAAACCAAATAGTAACCAAGTTAATCTTGCCTCCTGCTGTGTGAT 
GCCACCAGATTTGACTGCATTTGCTAAACAATTTGACATACAGCTGTTGACTCACAATGATCCAAAAGAACTGCTTTCTGAAGC 
AAGTTTCCAAGAAGCTCTTCAGGAAAGCATTCCTGACATTCAAGCGCACGAGTGGGTGCCGCTGTGGCTACTGCGGTATTCGGT 
CATTGTGAAAAGTAGAGGAATTATCAAATCAAAAGGCTACATTTTACAAGCTAAAAGAAGGGGTTCTTAACTGACTTAGGAGCA 
TAACTTACCTGTAATTTCCTTCAATATGAGAGAAAATTGAGATGTGTAAAATCTAGTTACTGCCTGTAAATGGTGTCATTGAGG 

50 CAGATATTCTTTCGTCATATTTGACAGTATGTTGTCTGTCAAGTTTTAAATACTTATCTTGCCTCCATATCAATCCATTCTCAT 
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GAACCTCTGTATTGCTTTCCTTAAACTATTGTTTTCTAATTGAAATTGTCTATAAAGAAAATACTTGCAATATATTTTTCCTTT 
ATTTTTATGACTAATATAAATCAAGAAAATTTGTTGTTAGATATATTTTGGCCTAGGTATCAGGGTAATGTATATACATATTTT 
TTATTTCCAAAAAAAATTCATTAATTGCTTCTT 

TGGAAGAAGATATTTTTCCTGTCATTGATGAGATATATCAGAGTAACTGGAGTAGCTGGGATTTACTAGTAGTGTAAATAAAAT 
5 TCACTCTTCAATAC 

115 

MTAIIKEIVSRNKRRYQEDGFDLDLTYIYPNIIMGFPAERLEGWRTO 

VAQYPFEDHNPPQLELIKPFCEDLDQWLSEDDNHVAAIHCKAGKGRTGVMICAYLLHRGKFLKAQEALDFYGEVRTRDKKGVTI 
PSQRRYWYYSYLLKNHLDYRPVALLFHKmFETIPMFSGGTCNPQFWCQLKVKIYSSNSGPTRREDKFMYFEFPQPLPVCGD 
10 IKVEFFHKQNKMLKKDKMFHFWWOT 

FSPNFKVKLYFTKTVEEPSNPEASSSTSVTPDVSDNEPDHYRYSDTTDSDPENEPFDEDQHTQITKV 

116 

CCTCCCCTCGCCCGGCGCGGTCCCGTCCGCCTCTCGCTCGCCTCCCGCCTCCCCTCGGTCTTCCGAGGCGCCCGGGCTCCCGGC 
GCGGCGGCGGAGGGGGCGGGCAGGCCGGCGGGCGGTGATGTGGCAGGACTCTTTATGCGCTGCGGCAGGATACGCGCTCGGCGC 

'15 TGGGACGCGACTGCGCTCAGTTCTCTCCTCTCGGAAGCTGCAGCCATGATGGAAGTTTGAGAGTTGAGCCGCTGTGAGGCGAGG 
CCGGGCTCAGGCGAGGGAGATGAGAGACGGCGGCGGCCGCGGCCCGGAGCCCCTCTCAGCGCCTGTGAGCAGCCGCGGGGGCAG 
CGCCCTCGGGGAGCCGGCCGGCCTGCGGCGGCGGCAGCGGCGGCGTTTCTCGCCTCCTCTTCGTCTTTTCTAACCGTGCAGCCT 
CTTCCTCGGCTTCTCCTGAAAGGGAAGGTGGAAGCCGTGGGCTCGGGCGGGAGCCGGCTGAGGCGCGGCGGCGGCGGCGGCGGC 
ACCTCCCGCTCCTGGAGCGGGGGGGAGAAGCGGCGGCGGCGGCGGCCGCGGCGGCTGCAGGTCCAGGGAGGGGGTCTGAGTCGC 

20 CTGTCACCATTTCCAGGGCTGGGAACGCCGGAGAGTTGGTCTCTCCCCTTCTACTGCCTCCAACACGGCGGCGGCGGCGGCGGC 
ACATCCAGGGACCCGGGCCGGTTTTAAACCTCCCGTCCGCCGCCGCCGCACCCCCCGTGGCCCGGGCTCCGGAGGCCGCCGGCG 
GAGGCAGCCGTTCGGAGGATTATTCGTCTTCTCCCCATTCCGCTGCCGCCGCTGCCAGGCCTCTGGCTGCTGAGGAGAAGCAGG 
CCCAGTCGCTGCAACCATCCAGCAGCCGCCGCAGCAGCCATTACCCGGC.TGCGGTCCAGAGCCAAGCGGCGGCAGAGCGAGGGG 
CATCAGCTACCGCCAAGTCCAGAGCCATTTCCATCCTGCAGAAGAAGCCCCGCCACCAGCAGCTTCTGCCATCTCTCTCCTCCT 

25 TTTTCTTCAGCCACAGGCTCCCAGACATGACAGCCATCATCAAAGAGATCGTTAGCAGAAACAAAAGGAGATATCAAGAGGATG 
GATTCGACTTAGACTTGACCTATATTTATCCAAACATTATTGCTATGGGATTTCCTGCAGAAAGACTTGAAGGCGTATACAGGA 
ACAATATTGATGATGTAGTAAGGTTTTTGGATTCAAAGCATAAAAACCATTACAAGATATACAATCTTTGTGCTGAAAGACATT 
ATGACACCGCCAAATTTAATTGCAGAGTTGCACAATATCCTTTTGAAGACCATAACCCACCACAGCTAGAACTTATCAAACCCT 
TTTGTGAAGATCTTGACCAATGGCTAAGTGAAGATGACAATCATGTTGCAGCAATTCACTGTAAAGCTGGAAAGGGACGAACTG 

30 GTGTAATGATATGTGCATATTTATTACATCGGGGCAAATTTTTAAAGGCACAAGAGGCCCTAGATTTCTATGGGGAAGTAAGGA 
CCAGAGACAAAAAGGGAGTAACTATTCCCAGTCAGAGGCGCTATGTGTATTATTATAGCTACCTGTTAAAGAATCATCTGGATT 
ATAGACCAGTGGCACTGTTGTTTCACAAGATGATGTTTGAAACTATTCCAATGTTCAGTGGCGGAACTTGCAATCCTCAGTTTG 
TGGTCTGCCAGCTAAAGGTGAAGATATATTCCTCCAATTCAGGACCCACACGACGGGAAGACAAGTTCATGTACTTTGAGTTCC 
CTCAGCCGTTACCTGTGTGTGGTGATATCAAAGTAGAGTTCTTCCACAAACAGAACAAGATGCTAAAAAAGGACAAAATGTTTC 

35 ACTTTTGGGTAAATACATTCTTCATACCAGGACCAGAGGAAACCTCAGAAAAAGTAGAAAATGGAAGTCTATGTGATCAAGAAA 
TCGATAGCATTTGCAGTATAGAGCGTGCAGATAATGACMGGAATATCTAGTACTTACTTTAACAAAAAATGATCTTGACAAAG 
CAAATAAAGACAAAGCCAACCGATACTTTTCTCCAAATTTTAAGGTGAAGCTGTACTTCACAAAAACAGTAGAGGAGCCGTCAA 
ATCCAGAGGCTAGCAGTTCAACTTCTGTAACACCAGATGTTAGTGACAATGAACCTGATCATTATAGATATTCTGACACCACTG 
ACTCTGATCCAGAGAATGAACCTTTTGATGAAGATCAGCATACACAAATTACAAAAGTCTGAATTTTTTTTTATCAAGAGGGAT 

40 AAAACACCATGAAAATAAACTTGAATAAACTGAAAATGGACCTTTTTTTTTTTAATGGCAATAGGACATTGTGTCAGATTACCA 
• . • GTTATAGGAACAATTCTCTTTTCCTGACCAATCTTGTTTTACCCTATACATCCACAGGGTTTTGACACTTGTTGTCCAGTTGAA . 
AAAAGGTTGTGTAGCTGTGTCATGTATATACCTTTTTGTGTCAAAAGGACATTTAAAATTCAATTAGGATTAATAAAGATGGCA 
CTTTCCCGTTTTATTCCAGTTTTATAAAAAGTGGAGACAGACTGATGTGTATACGTAGGAATTTTTTCCTTTTGTGTTCTGTCA 
CCAACTGAAGTGGCTAAAGAGCTO? i rGTGATATACTGGTTCACATCCTACCCCTTTGCACTTGTGGCAACAGATAAGTTTGCAGT 

45 TGGCTAAGAGAGGTTTCCGAAAGGTTTTGCTACCATT.CTAATGCATGTATTCGGGTTAGGGCAATGGAGGGGAATGCTCAGAAA 
GGAAATAATTTTATGCTGGACTCTGGACCATATACCATCTCCAGCTATTTACACACACCTTTCTTTAGCATGCTACAGTTATTA 
ATCTGGACATTCGAGGAATTGGCCGCTGTCACTGCTTGTTGTTTGCGCATTTTTTTTTAAAGCATATTGGTGCTAGAAAAGGCA 
GCTAAAGGAAGTGAATCTGTATTGGGGTACAGGAATGAACCTTCTGCAACATCTTAAGATCCACAAATGAAGGGATATAAAAAT 
AATGTCATAGGTAAGAAACACAGCAACAATGACTTAACCATATAAATGTGGAGGCTATCAACAAAGAATGGGCTTGAAACATTA 

50 TAAAAATTGACAATGATTTATTAAATATGTTTTCTCAATTGTAAAAAAAAAA 
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TTCTATCAGAAATTGTATTTATTTACCAGCATTTATGGCAAATGGCCAGACCTGGGCAGGGTGGTAGCAATGATAGATTTTACA 
AATTAAGGCATTGTTCTTTATTTCAAANTTTTCATTTACCNATATTTTCCTTGGTCTTAGAAATATGAATANTTTAGAGCTGGC 
AATACTCACCATCAGGATATAATAAACGGAGGTTTCTTTNTCCGAAATCCATAAANTGTAGTANTACTCTNTTGTACTTTTAAA 
5 ATTCCNATTTTTGCAGTNGGCTTCCTCTCAGTGATTTAGTTAGGNAGTTTTGGTACATTTTGGNGGGGTCATAAACATGTTCAT 
AGGATAGGAGTACNGGGCATTTNTGGAAAAAC CCG 

119 

MERVKMIWQRLLEAAEFLERRERECEHGYASSFPSMPSPRLQHSKPPRRLSRAQKHSSGTSNTSTANRSTHNELEKNRRAHLR 
LCLERLKVLIPLGPDCTRHTTLGLLNKAKAHIKKLEEAERKSQHQLENLEREQRFLKWRLEQLQGPQEMERIRMDSIGSTISSD 
10 RSDSEREEIEVDVESTEFSHGEVDNISTTSISDIDDHSSLPSIGSDEGYSSASVKLSFTS 

120 

AGATTATGATCGCCTGAGGCCCCTCTCCTACCCAGATACCGATGTTATACTGATGTGTTTTTCCTTTTTTTTTTTTTTTTTTTA 
AGTAATTAAGGGTAGTTAAATTATTTAAAGTATACAAAGTCCAAACAGCCAGGGGTAAGGTCTCCAAGAGGCCTTCCCAGGGTA 
AGGGAGTGCGGAGAGGCCCCGGTCGCCACCCGCGGTGCCCATGGAGCGGGTGAAGATGATCAACGTGCAGCGTCTGCTGGAGGC 

15 TGCCGAGTTTTTGGAGCGCCGGGAGCGAGAGTGTGAACATGGCTACGCCTCTTCATTCCCGTCCATGCCGAGCCCCCGACTGCA 
GCATTCAAAGCCCCCACGGAGGTTGAGCCGGGCACAGAAACACAGCAGCGGGACGAGCAACACCAGCACTGCCAACAGATCTAC 
ACACAATGAGCTGGAAAAGAATCGACGAGCTCATCTGCGCCTTTGTTTAGAACGCTTAAAAGTTCTGATTCCACTAGGACCAGA 
-CTGCACGCGGCACACAACACTTGGTTTGCTCAACAMGCCAMGGACACATCAAGAAACTTGAAGAAGCTGAAAGAAAAAGCCA 
GCACCAGCTCGAGAATTTGGAACGAGAACAGAGATTTTTAAAGTGGCGACTGGAACAGCTGCAGGGTCCTCAGGAGATGGAACG 

20 AATACGAATGGACAGCATTGGATCAACTATTTCTTCAGATCGTTCTGATTCAGAGCGAGAGGAGATTGAAGTGGATGTTGAAAG 
CACAGAGTTCTCCCATGGAGAAGTGGACAATATAAGTACCACCAGCATCAGTGACATTGATGACCACAGCAGCCTGCCGAGTAT 
TGGGAGTGACGAGGGTTACTCCAGTGCCAGTGTCAAACTTTCATTCACTTCATAGAACCCAGCATGACATAACAGTGCAGGGCA 
AAATATTCACTGGGCCAATTCAATACAAACAATCTCTTAAATTGGGTTCATGATGCAGTCTCCTCTTTAAAACAAAACAAAACA 
AAACAAAACTATACTTGAACAAAAGGGTCAGAGGACCTGTATTTAAGCAAATACTTAGCAAAAAGTGGGGCAGAGCTCCCAAGG 

25 AGAACAMTATTCAGAATATTCATATTGGAAAAATCACAATTTTTMTGGCAGCAGAAAACTTGTGTGAAATTTTCTTGATTTG 
AGTTGATTGAGAAGAGGACATTGGAGATGCCATCCTCTTTCTCTTTTCTCGTTTGCTCATACTACATTGAGTAGACACATTTAA 
GGATGGGGTTATGAACCCTTCCTGAGCTTTATGGTCCTAAAAGCAAAATAAAAACTATTCGAATGAAAAGACAAGAAAATCAGG 
TATTAATCTTGGATAGCTAATAATGAGCTATTAAAACTCAGCCTGGGACAGTTTATCATGAAGCCTGTGGATGATCAATCCTTT 
ATTATTATTTTTTTTTTTTGAAAAAAGCTCATTTCATGCTCTGCAAAAGGAGAGACTCCCATGAAGCCTTTTGAAAGGGATCAT 

30 CATGCAGCTCAACTTTCTGTTGGATTCCATGCTAAGCAAGCTAACCTTATCCTGCATTGTTAGCACTAGGCACCCAGCTGCCAC 
CTCTCCATCCTGCTGCCCTTAGGCCACATGGGAGCAGTCCATGCATGACAGCCTCTATCCTACAAGGCCTATGAGTATGGATTG 
GGGGGGCCAAAAGGAAAAAGCTCCATGTGCCTCTTTGTCTGCGTGGGTCAGAAGAGTTGTGCACGCAGATTAGCAGGCCAAGGT 
CTGAGCCACAGCAGCATTTTTATTTCAGATTTTGATAACTGTTTATATGTGTTGAAAACCAAAATGACATC.TTTTTAAAGCTTA' 
TCCATAAAAAAAAATAGATGTCTTTTATAGTGGAAAAACACATGGGGAAAAAAATCATCTATTTTGATGCAGCATTTGATAATG 

35 ATAAAACACCTCACACCTCACTCTTTATAGTGCACAAAATGAATGAGGTCTGGGCTAGGTAGAAAAAGGGTCAATGCTATTTTT 
GTTTTTAGAATCATTACCTTTTACCAGCTTTTAACCATCTGATATCTATAGTAGACACACTATCATAGTTAACATAGTTAAGTT 
CAGCACTTGTCTCATTTTAATGTAAAGATTTGCTTCCATTTTCCTACAGGCAGTCTCTCTCTTCCTCACAGTCCCACTGTGCAG 
GTGCTATTGTTACTCTTACGAATATTTTCAGTAATGTTATTTTCTTCTAAGTGAAATTTCTAGCCTGCACTTTGATGTCATGTG 
TTCCCTTTGTCTTTCAAACTCCAAGGTTCCCCTGTGGCCCTCTCCCTTACCCTGGGAAGGCCTCTTGGAGACCTTACCCCTGGC 

40 TGTTTGGACTTTGTATACTTTAAATAATTTAACTACCCTTAATTACTTAAAAAAAAAAAAAAAA 

121 

. MNSNFITFDLKMSLLPSNLFSAFITLCFGAIFFLPDSSKLLSGVLFHSSPALQPAADHKPGPGARAEDAAEGRARRREEGAPGD 
PEAALEDNLARIRENHERALREAKETLQKLPEEIQRDILLEKKKVAQDQLRDKAPFRGLPPVDFVPPIGVESREPADAAIREKR 
AKIKEMMKHAWNNYKGYAWGLNELKPI SKGGHS S SLFGNIKGATI VDALDTLF IMEMKHEFEEAKS WVEENLDFNVNAEI SVFE 
45 WIRFVGGLLSAYYLSGEEIFRKKAVELGVKLLPAFHTPSGIPWALLMKSGIGRMPWASGGSSILAEFGTLHLEFMHLSHLS 
GNPIFAEKVMNIRTVLNKLEKPQGLYPNYLOT^ 

IRKSSSGLTYIAEWKRGLLEHKMGHLTCFAGGMFALGADAAPEGMAQHYLELGAEIARTCHESYNRTFMKLGPEAFRFDGGVEA 

IATRQNEKYYILRPEVMETYMYMRLTHDPKYRKWAWEAVEALENHCRWGGYSGLRDVYLLHESYDDVQQSFFLAETLKYLY 

IFSDDDLLPLEHWIFNSEAHLLPILPKDKKEVEIREE 
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122 

CCAACTTATTTAAAACAAAACAATTTTGTAGGTATTATTATACCCATTTCACAGATGATGATAAATGAGACCAATAGAAGTTAA 
ATAACTTGCCAAAGGCCACACAGCTGGTGAGTGATGGAGAACGAATTAAAACTCAAGTGAGCATAATTCTAAAAGCCATCTTCT 
CGTTAGTGTTTCTCACTATCCAGGTCTGCCTTTGCCTTATTTAACTGAAGTTAAGCCATCCTTACCTGTGATCACCTAGCCTCT 
5 CAGTTTGGGGGGATCATTACAGCGGGTTTTTAACTCCCAATGTTCTGGTCCAGTTTGCTTTACATGTTCTTATTTATACATTGT 
CAAGGATGACCTCAGGACAGTACAGCAAGGACACAGTGGCACTTCACATTTTGTTCCCACGAAATGACTGGGGCATAATCTCAG 
ATCATCTTCCTTTAGAATGTGGAAACATCAGCAGAAGAATATTAGTCTTTATACAAGTCAMTCCAAAATGACACATGTGAAAA 
CTAATAGAGCTGACTTTCAGCCATGATAGCTTTGGCACACCTCACATCCCTTTGTTCAACCTCTCTTCCCTCAACGGAGAGCTG 
CATTCCTGGGAATTTCTGTTGTGCACTTTTCCCACTTGCCCTGCTGTCATTTAAAGGTGAACATTCTAGTTTTGCTAAGAAAAC 

10 CCTTTCCTTCATTTGGAATGAACAGCAATTTTATTACTTTTGACCTTAAAATGAGTTTGCTGCCTTCAAATCTTTTCAGCGCCT 
TCATCACGCTCTGCTTCGGGGCGATCTTCTTCCTGCCAGACTCCTCCAAGCTGCTCAGCGGGGTCCTGTTCCACTCCAGCCCCG 
CCTTGCAGCCGGCCGCCGACCACAAGCCCGGGCCCGGGGCGCGCGCCGAGGACGCGGCCGAGGGGCGAGCCCGGCGCCGCGAGG 
AGGGGGCACCCGGGGACCCGGAGGCCGCCCTGGAGGACAACTTGGCCAGGATCCGCGAAAACCACGAGCGGGCTCTCAGGGAAG 
CCAAGGAGACCCTGCAGAAGCTGCCCGAGGAGATCCAAAGAGACATCCTACTGGAGAAGAAGAAGGTGGCCCAGGACCAGCTGC 

15 GTGACAAGGCGCCGTTCAGAGGCCTGCCCCCGGTGGACTTCGTGCCCCCAATCGGGGTGGAGAGCCGGGAGCCCGCCGACGCCG 
CCATCCGCGAGAAAAGGGCAAAGATCAAAGAGATGATGAAACATGCTTGGAATAATTATAAAGGTTATGCCTGGGGATTAAATG 
AACTCAAACCTATATCAAAAGGAGGCCATTCAAGCAGTTTGTTTGGTAACATCAAAGGAGCAACTATAGTAGATGCCCTGGATA 
CACTTTTTATTATGGAAATGAAACATGAATTTGAAGAAGCAAAATCATGGGTTGAAGAAAATTTAGATTTTAATGTGAATGCTG 
AAATTTCTGTCTTTGAAGTAAATATACGCTTTGTTGGTGGACTACTCTCAGCCTACTATCTGTCTGGAGAAGAGATTTTTCGAA 

20 AGAAAGCAGTGGAACTTGGGGTAAAATTGCTACCTGCATTTCATACTCCCTCTGGAATACCTTGGGCATTGCTGAATATGAAAA 
GTGGTATTGGAAGGAACTGGCCCTGGGCCTCTGGAGGCAGCAGTATTCTGGCAGAATTTGGAACCCTGCATTTGGAGTTTATGC 
ACTTGAGCCACTTATCAGGAAACCCCATCTTTGCTGAAAAGGTAATGAATATTCGAACAGTACTGAACAAACTGGAAAAACCAC 
AAGGCCTTTATCCTAACTATCTGAATCCCAGTAGTGGACAGTGGGGTCAACATCATGTATCAGTTGGAGGACTTGGAGACAGCT 
TCTATGAGTATTTGCTGAAGGCCTGGTTAATGTCTGACAAGACAGATCTGGAAGCTAAGAAGATGTATTTTGATGCTGTTCAGG 

25 CTATCGAGACTCATTTGATCCGCAAGTCTAGCAGCGGACTAACTTATATCGCAGAGTGGAAAAGGGGCCTCCTGGAGCACAAGA 
TGGGCCACCTGACCTGCTTCGCGGGGGGCATGTTCGCACTCGGGGCTGATGCAGCTCCCGAAGGCATGGCCCAACACTACCTTG 
AACTCGGGGCTGAAATTGCCCGTACTTGTCATGAATCATATAATCGAACATTTATGAAACTGGGACCAGAAGCTTTCAGATTTG 
ATGGTGGTGTTGAAGCCATCGCTACAAGACAAAATGAAAAATACTACATCTTACGGCCAGAAGTTATGGAGACTTACATGTATA 
TGTGGAGACTGACTCATGATCCAAAGTACAGGAAATGGGCCTGGGAAGCCGTAGAGGCCTTGGAAAACCATTGCAGAGTGAATG 

30 GAGGCTATTCAGGCCTAAGGGATGTTTACCTTCTTCATGAGAGTTATGATGATGTGCAGCAGAGTTTCTTCCTGGCAGAGACAT 
TGAAATATTTGTACCTAATATTTTCTGACGACGATCTTCTTCCACTGGAGCATTGGATCTTCAATAGCGAGGCACATCTTCTCC 
CTATCCTCCCTAAAGATAAAAAGGAAGTTGAAATCAGAGAGGAATAAAAAAGACATTTATATTTTATTCTGCTCCATTCCCTTC 
ACTGTATACCTTAATAATTCCTTTTCTGGTAATCAGGCACATGATGAACTTTGATTAGTAGGTCTGTGATTAAGTTCTTAAATT 
GTTTTGCAGTCTTTTATGTTTATTATCATAGGTATAGGTGGACCTAAATTCCTTATCATATCTTTATTAATTCAGCCAGTGTAT 

35 CCACCAGTTTTTTGTTTATGTTTOT 

AGAATTGTATAGCGATGACCTCTTAATTATAATTTGATTTGACTGCAAAACTTTTTCCTCCTCTAAGAGGAGATGATGTCTGCT 
TTAAGCTGTAATGTTTTGCCATGTTGCAAAAAGCCATAATAATAAGTATAAAAAAGCTTTTTCCTTTACAATTTCATGTTAATC 
TGGTTTGTCTGTCCACCAGAGACAGATCTTCTGTGACAGCCTCCTTATGCAGGTCTATCATTATTTGATAGAATGTCTTCTAAA 
ATACTTCACTCACATTGTAATTCAAATTAGAAAGTCATTCCAAAAGGTCATGTCATGTTGACCTCATTTCATCGGAACTGCAGT 

40 ATATTTTTGTTGGTTAATTATATTAGTGTTTTCTATTTTGAAAAAAAAAAAAAAAAAA 

124 

TTTTGGTCTGAGAATGATGGACATTTAGACACTGGCGCCAGGTTTGCGCTGGACCGGCGCACGCAGGGGTTGGGCGGACGAAAG 
ACACACAGGTGGGCTACAGGTGTCACACGGCACCAGCCAGGGCCCGGGGTGGCTGGGGTGAGGATGGGTGTTTGGCCAGTGACC 
AGGAGTCAGGTCAAGTCCAGGTGGTCAGTGCCAGGGGTCCCAGGAGGGGAGGGCAGTGCATTAACCCTCCTGGTGTCCAGCGTC 
45 ACCAGGCGGTCGTCACAGAAAGCAACCTCGGCCCGGGGCCG 

126 

AATTATATATTACATGTTTATTAAGAGCACAACTTTTAT6TAAMTTO 

CTTGGAAGACAGATGTGCATTGTTCTAATTACAATCCAAAGTAGTAAATAACAATCCTTTAAAACTCACATTTATTAGAGTTGT 
GTTTACAAATTCTTGGTTAAAGAGGCAGCTACAAAGTTTACCACTATATATAAGCAAGAACCAGCTTGCTAGGTACATTTCCCA 
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TTGAAAATCTACTGGTCTCTTTTACACCATTAGTGGATTTTTAAATGGAAAAAAAAAATCMTATAMCTCATATGGCTTC^ 
ATTGTA 

128 

AATATAGTTATCTTCTTAAAAACCATTATAACAATTCAGAGAGAGTTCTTTACAAAGCCATGAATATGMCTATGGGGAATCAT 
5 GGTTCTTTTAAAGCAATTTTCAAAATAAGTACCAATTAAAGCTTTAGGTTCCAAGAAGATTCTGGGACTCAGGAAGAAAAAGTG 
CCATCAGGTGACCAGCTGTTGCATTTCTTGCTTATTCTGTTTTGTTTTTGCACATCATAATGGATTTTTCTTAGTGCCCTAATT 
GTGAAGGGTTTCTCTAGCTTTGGTTATGTGTAATGTTCACGTGACCTTTTTTTGTCAATCGTTTTTGGAATTTTTCTTTCTTTC 
TGTGCTTTATTACTAATAAGTCCAATGAGTGAGTAGTAGCTAGATGACTAGTATGTAGTTTTATATTTTGGTAAAATTATTTGC 
CCTTTCAGAAATGCCTCATCTAAAGATACATGATAATTTTGGAGTTGGAGGGGGCCTTAGAGGCTCTCCAGCTCTGCTTCTTGC 

10 CCATTGCCAAATACTGAAATGGAAGCCCGTCTTACCTGGGGTCACTAACTGGTTGGTTAACTGAGCTAAGAATAGGCTGTGGGT 
CTCCTCACTTGTGGCCCAGTGCTCTTTCTGCTATACAAAATGTCTAATCTCAGATTTTTCTTCTGCTGCTTGACTGGTTCATCT 
GGATGMCTACAAAAAACCCATGATTAAGGTTTATGAATTCAAGTAATAATTAGATTTTTTTTGCACAGACTTACTTAACTTCC 
TTATTGGATATGTTTGTAACACATAAACACAAAGCACTTTTCAAACATGATGCACTTTTATCTTTGTGAATAATTTACTGTCCT 
TTCCTCCTGGGATATGAGAAACATTTTAAAAAACGTATTTMCAGAAGAGAGCAAATAAAGATATATCAGGAAGGATGTATTAG 

15 TTATTTACTTAAATGTTTATAATATCTGGATTTTTTTTGTTTTGTTACTCATAGAACTGGTGTTGTTTGCTGTTTTTATTTCTC 
TAATTGTTGCAGAGTTCTGCCTGTTACAAAGCTACAGAACTGTAT'TGTTTTTATTTTCCTTCTTGAGCACATGTTAACAAACTA 
AGCTTCACATTAGAGTGATGTCATAATGTAAAATGTTTGCATTGTGGTTAGGTATTGAAGTTTATGTCCTGTCTGTGTAAAGAT 
TCATCTTTTATTGTAAATATTTAGACTTTACCACAGAAATATTGGAACAGTTTGCTTTATAAGATTAAAAAGCATCCTTCAGAA 
TGGAGCTTGCCTTGTGCTTAGAMTAATATGTTGMCTATTTTGCAATATACTATTTTAAATCTAAATTCTGTCACTTCGCTGC 

20 CTTCTTAAAATAGTGTGGTATTTCAAATATTGCTAGAGCTATTTTCCTGAAATACATTTGCAAAATAAGGCTGCTTTGTAATCA 
AGGAATATTTTTATTGATTGAAGGAMTGACTGTACTGCGATTCAAAAGTAAACTTATTTTATTATACAGATTATTTCTTAAAA 
ACTCTATTTATACCTTAACATGAAATCCATGACCACACCi^ 

AAGTTAAAAACAGTAATGCCAACATTGAATTTATTTTTGAGGTCAAAGAACCAGTTGTTCTTTTTATATTTAGATGAGGATGAT 
TGAGTCCATATACTATGTATGTTTACATATACTATACATGCACATTAGGTGTTTTCATTTGTGTTTTGCTTATGAAATGTCATT 
25 TAAAGTTCACTTCTTGAGCATCAATAAAAAGGGAAGCTGTGTGGTTTTGG 

129 

MSLWSAGRGLGAVWSPTHVQVTVLQARGLRAKGPGGTSDAYAVIQVGKEKYATSVSERSLGAPVWREEATFELPSLLSSGPAA 
AATLQLTVLHRALLGLDKFLGRAEVDLRDLHSSLGKSFFKTLKKRAWAIFLRLCLKKN 

130 

30 GTGGAGGCCGCCAGTCGCGGCGATCTTCTCCTCGCTTCTGGAGTGTTATCGTCACCATGTCCCTAATGGTCTCGGCTGGCCGGG 
GCCTGGGGGCCGTGTGGTCCCCAACCCACGTGCAGGTGACGGTGCTGCAGGCGCGGGGCCTGCGGGCCAAGGGCCCCGGGGGCA 
CGAGCGACGCGTACGCGGTGATCCAGGTGGGCAAGGAGAAGTACGCCACCTCCGTGTCGGAGCGCAGCCTGGGCGCGCCCGTGT 
GGCGCGAGGAGGCCACCTTCGAGCTGCCATCGCTGCTGTCCTCCGGACCCGCGGCCGCCGCCACCCTGCAGCTCACCGTGCTGC 
ACCGCGCGCTGCTCGGCCTCGACAAGTTCCTGGGCCGCGCCGAGGTGGACCTGCGGGATCTGCACTCCAGCCTGGGCAAGAGTT 

35 TTTTTAAGACTCTTAAAAAAAGAGCCTGGGCAATTTTT 

TTCACTCTACAGGGGACAGGAGACCATGGATTGGACCCCAAAGGGATTGAACTGCATCTGCATGTCTGTCCTTTGAACACTTTC 
TCTCCCTGCCCAAAAGGAAACCCAMTTATTTGTGGGATACTGGGGAAATTGTAGTGAAGGGCTTAATGTAGTTMTAAAAGTT 
AAAAGTCAGTAGAAAACAGGTGCCTCAGCCTTCAAATGGTTGCTTTTTTTCCATTTTCCCTCATGAATAGACTCACCAGCATTT 
TACCCCCTTGTTATAAAACTGTGCAGAGCAAGAAGATC^^ 

TTAAGACTGGAATTAGAATGTTTCTGGTATTTCTCATTTTTTTCCCTGGCTATGATAGAATCTCATCTAATCTTGACATCTCTC 
CTAGGGGAAGAATATCACAGGCTAATAGCGTGGTTGGGGGTGAAGATGATAGCAGTTATTAAATCAGGAATCTCTTTTATGTAT 
GTCCTTGTTACATTGAGGTTAAGAGACAAAATCATTGGCAGTGCAATCTCTTTCCAGGATTTCGTTTGCTGTGGCATTGGTTAT 
ATCAGAGCACTTTAATCTGAAGGATGATACTGTAACTTGATTTATCTAATTAGCTTTTAATTATCTACAGCTATTTTATTTTAT 
45 TTAATTCTCTCCTACAGTACTGGGACCACTGTAAACTTCTC 

CTTATAAAGAAAACTGTATGTTCACTTGAAAAAAAAAAAAAAAAAA 



131 
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MCAFPWLLLLLLLQEGSQRRLWRWCGSEEWAVLQESISLPLEIPPDEEVENIIWSSHKSLATWPGKEGHPATIMVTNPHYQG 
QVSFLDPSYSLHISNLSWEDSGLYQAQVNLRTSQI^ 

WLSRGDSTYTFHEGPVLSTSWRPGDSALSYTCRAmPISWSSCPIPDGPFYADPNYASEKPSTAFCLLAKGLLIFLLLVILAM 
GLWVIRVQKRHKMPRMKKLMRNRMKLRKEAKPGSSPA 

5 132 

ACTAGGAGCCTCCTAATGCAGTCTTCTGCACAGTCCTGGGGACTGACTGACTGAATCACACCTCTGGGGCTGGGGGCTGCTGAC 
ATGTGTGCCTTTCCTTGGCTGCTTCTTCTCCTGCTGCTCCAGGAGGGCAGCCAAAGGAGACTCTGGAGATGGTGTGGATCCGAG 
GAAGTGGTTGCGGTCCTTCAGGAGTCCATCAGCCTCCCCCTGGAAATACCACCAGATGAAGAGGTTGAGAACATCATCTGGTCC 
TCTCACAAAAGTCTTGCCACTGTGGTGCCAGGGAAAGAGGGACATCCAGCTACCATCATGGTGACCAATCCACACTACCAGGGC 

10 CAAGTGAGCTTCCTGGACCCCAGCTATTCCCTGCATATCAGCAATCTGAGCTGGGAGGATTCAGGGCTTTACCAAGCTCAAGTC 
AACCTGAGAACATCCCAGATCTCTACCATGCAGCAGTACAATCTATGTGTCTACCGATGGCTGTCAGAGCCCCAGATCACTGTG 
AACTTTGAGAGTTCTGGGGAAGGTGCCTGCAGTATGTCCCTGGTGTGCTCTGTGGAGAAGGCAGGCATGGATATGACCTACAGC 
TGGCTCTCCCGGGGGGATAGCACTTATACATTCCATGAAGGCCCTGTCCTCAGCACATCCTGGAGGCCGGGGGACAGTGCCCTC 
TCCTACACCTGCAGAGCCAACAACCCCATCAGCAACGTCAGTTCTTGCCCCATCCCTGATGGGCCCTTCTATGCAGATCCTAAC 

15 TATGCTTCTGAGAAGCCTTCAACAGCCTTCTGCCTCCTGGCCAAGGGATTGCTCATCTTCTTGCTCTTGGTAATTCTGGCCATG 
GGACTCTGGGTCATCCGAGTCCAGAAAAGACACAAAATGCCAAGGATGAAGAAACTCATGAGAAACAGAATGAAATTGAGGAAG 
GAGGCAAAGCCTGGCTCCAGCCCTGCCTGACTGCTCCTTGGGAACCCCAGTCCTGAGCTTGGTTTCTTCCCAGCACCCAGAGAA 
TCCTTCCTCAGCTCTCTTCTTTCCAGGGGAAGGAGGTGCTCAGGGGTGGGTATCCAGAGAGCCATACTTCTGAGGGAAGACTGG 
CTGGC AATAAAGTCAAATTAAGTGACCAC CAAAAAAAAAAAAAAAAA 

20 133 

MIEEKSDIETLDIPEPPPNSGYECQLRLRLSTGKDLKLWRSTDTVFHMKRRLHMEGVEPGSQRWFFSGRPLTDKMKFEELKI 
PKDYWQVIVSQPVQNPTPVEN 

134 

ATATCAGCTTCCAGTGTATTGCTTGGCACCGCCAATCAACATGATAGAGGAAAAGAGCGACATAGAGACTCTGGATATTCCTGA 

25 GCCACCACCCAATTCTGGATATGAATGTCAGCTTCGTTTGCGC.CTTTCCACAGGCAAAGACCTCAAGCTTGTGGTTCGCAGCAC 
AGACACAGTATTCCACATGAAGAGACGGTTGCATGCAGCAGAGGGAGTGGAACCAGGTAGTCAGCGGTGGTTTTTTTCTGGCAG 
ACCTCTCACTGACAAAATGAAGTTCGAAGAGCTGAAGATCCCAAAGGACTATGTTGTACAGGTTATAGTGAGCCAACCTGTGCA 
GAACCCAACACCAGTGGAGAACTGAACTGAGCCCTGTTGGCCAGCTCCCACATCCCTCTGCTCCTTTTTATGGTTCTTGTTGTC 
ATTTCCTACTCTGCGGCGTGAAATCTATTTCACTGCTCTAAATTCCCTATGAATGGATTTAGTTCTGAGGAATTACCAGTGAAA 

30 AATTCCATCTGTGATGGAGACCAACAAAAATAATAAAACACAAAGAGCCAGGCTTTGAGACTCATGTMTTACAATTTCTAATT 
TGAAAGGCAGTTAAGAAATAGATAACCATTTATTTTAGMCACTCAACAACTATGTAATGGCTATATTTCAGTGACTTGGACTG 
TAAATGAAACATTGCATCCATGAAGGACAGCACCAAGCACCTTTTTGAATACAGAATTTTTTTAGAAAAATATATCAAATTATA 
TAATTTCCAGAAACCATAAATAATGGATATAAAACTTAACCTTTTTGTTTTGTTTTGTTTTGTTTTGTTTTGTTTTGTTTTGGT 
TAATGGAAACTGAAAAGAGCAGTATTTGAGGTTGCTTCTATTCTGGTTTTTTATTCTTAGCTCAATTAATATTTAGCCATAAAT 

35 GAGTAGAGACTGGCAAATTGTGCTTTAGTGTTGCTTTCCTCATCCCCACATCTTGAGCTCCTTATTTACATTCACACTAAATTT 
TGGTGCCTTCCAGCACATTAGTGGCAGGCACCCTTCTGGAACACTAGGCAATAATTTCATCAATACAGTCAGGTCTCTTGAGTT 
TCAACAGATACTCAGTTGAAAAGTCGCTGTCATCTTGCTGCATAAGTATTTTGAAAGGTCTGTATAACGAAGCCATTTTTATAT 
CCAGGCTTAGAAGGTCACTACTATATAGTACCTTCATTGATCTATCTATTCTGCTTGGAACTTTTCATAGCTAAGTATAACCCC 
CAAATGCATGGTTCCTGGGTCAGGAGACACCAAAATCAATTATAGCTGTTCCCACTCAAGTTAATAAAGTAAATGATTTCCTCC 

40 ACTTTGCATGGAGGGGTGTAAGGAAAGCCTATTTTATCTGTGCCTGGGAGAACTGTGCCTATTTTCAGTCTTTTAGAGGAAATT 
TCAACTCAAAATTTTTAAGTGTGAAAGGTTTACTGGTGTCACATAAACATTAGCAGTGAGACCAAATAATGAAACATTGCTTTA 
TACCTTAGTGCTTTCCAGTTCACTGTTACCTCTAGTCATGGTAGATGACAATTTTCCTCCCTCATCTTTTGTAGCAAAGAAGCA 
AATTAAGAGCCTTTTGCAAGTTAGATTAGGGGACTGAAAAATCCAGGGAGATAACCTGATTCTTCTATACTATGACTGGAGTAA 
AGCTGCCAAGAGTTAGGTTGAATTTGGAGCCTTTTAAAAGTTGGTTCCATAAGTGAGAAGGGGGGCTAACATGTGACCACTGTG 

45 CCCCTGAGAATCCATGAGATGGAAAAGTACAGACMGAAGGCACTTAACGCTGTTCTCAGTTGGAAAATGTCAGCCCTCCACAC 
CTGTTTGGATGTGMGTGCATTATTAATTTTAATGACAAAATGAAACTGCTAATTAGTTTAAATGTAATGTTTAAATGGTATTG 
MTTTCCTATTGCCTGATTAMTATATCTGTAATTGTAACATCATGTTTCTGAGAGCTTTAACTACTTCCTTATTTTTATGCAA 
TCATTTTAAAGATTGTGGCTATATCTAATTTAAATTTGCTGATGATTGAATCTAAATTCAAGTGAAGACCACAGCATTAATCCC 



WO 02/46465 



PCT/GB01/05458 



315 

AAAGGAGATTTTTTTTATGTTTAGAGGTGGAGATATTAGGGTAAATCAGTATATTGCTCCTGTCATCAAAGTTTTTGTTCATTT 
GTGTGCCACATCGAATAGCAGCACTTCTCAAAGTTTGTCAACTGGAAAAAAGTGTTTAAAACTGTTTTTTTCTTAAGCAGATTT 
GGTGATGTCATCAATATCTATGATATCGATTGGTAAAAGGTCTTTCTTTGTTGAGGATTTGAGCTCTCTTTTTAAGAGTTTATT 
TTATTTTAAAGGAACATATGCTTTGMTAACAATTCTCTGGGCATGTTTAACTAACCTAGGTGGGAAAACCATGGTGCTGTTTA 
5 ATTGTAAATAGATTGATATAAGATTGGACCAGTACTTGATGTGTATAATTTTAGTATGTAGGGTTTATGTGCTTTTTTTAAAAA 
AAAAAAAAAGGTTCAATTTT 

136 

GGAAGGTATTAGACAAACATGCCCATGAAACCCCAGCTTCCCTTTTCTATGTGCTAGGCATGGAAACTTATGAAATTTTAGCAC 
TCCAAAGTCATTTGGACTTCAAGGCATTTAAAATCATTTTTCTAAGGATTTAAACAGCTCCACTATAAGTCTTCACCTGACATG 
10 AATTGGTGAGAGACCAGGCTGATCCTGGCAAAGGTCTTGTGTCTTTCTGCCAGGCAAAATCCTGGGTTCTTCTAGCAGGACCTA 
AGCCAGTCTGGGGACGCTGATATTGAGGATGAGCTGGGGGACTCTGCTCTGTCCTCTGTGAACACACAGGAGGCCCATCCAGAG 
TGAGTGAGGTTGATTCTCTCTCCCTCTTTGC 

137 ' ' 

MGTSLSPNDPWPLNPLSIQQTTLLLLLSVLATVHVGQRLLRQRRRQLRSAPPGPFAWPLIGNAAAVGQAAHLSFARLARRYGDV 
15 FQIRLGSCPIWLNGERAIHQALVQQGSAFADRPAFASFRWSGGRSMAFGHYSE 

HVLSEARELVALLWGSADGAFLDPRPLTWAVANVMSAVCFGCRYSHDDPEFRELLSHNEEFGRTVGAGSLVDVMPWLQYFPN 
PVRTVFREFEQLNRNFSNFILDKFLRHCESLRPGAAPRDM^^ 

TALQWLLLLFTRYPDVQTRVQAELDQWGRDRLPCMGDQPNLPYVLAFLYEAMRFSSFVPVTIPHATTANTSVLGYHIPKDTW 
FWQWSWHDPVKWPNPENFDPARFLDKDGLINKDLTSRVMIFSVGKRRCIGEELSKMQLFLFISILAHQCDFRMPNEPAKM 
20 FSYGLTIKPKSFKVNVTLRESMELLDSAVQNLQAKETCQ 

138 

ACTCTGGAGTGGGAGTGGGAGCGAGCGCTTCTGCGACTCCAGTTGTGAGAGCCGCAAGGGCATGGGAATTGACGCCACTCACCG 
ACCCCCAGTCTCAATCTCAACGCTGTGAGGAAACCTCGACTTTGCCAGGTCCCCAAGGGCAGCGGGGCTCGGCGAGCGAGGCAC 
CCTTCTCCGTCCCCATCCCAATCCAAGCGCTCCTGGCACTGACGACGCCAAGAGACTCGAGTGGGAGTTAAAGCTTCCAGTGAG 

25 GGCAGCAGGTGTCCAGGCCGGGCCTGCGGGTTCCTGTTGACGTCTTGCCCTAGGCAAAGGTCCCAGTTCCTTCTCGGAGCCGGC 
TGTCCCGCGCCACTGGAAACCGCACCTCCCCGCAGCATGGGCACCAGCCTCAGCCCGAACGACCCTTGGCCGCTAAACCCGCTG 
TCCATCCAGCAGACCACGCTCCTGCTACTCCTGTCGGTGCTGGCCACTGTGCATGTGGGCCAGCGGCTGCTGAGGCAACGGAGG 
CGGCAGCTCCGGTCCGCGCCCCCGGGCCCGTTTGCGTGGCCACTGATCGGAAACGCGGCGGCGGTGGGCCAGGCGGCTCACCTC 
TCGTTCGCTCGCCTGGCGCGGCGCTACGGCGACGTTTTCCAGATCCGCCTGGGCAGCTGCCCCATAGTGGTGCTGAATGGCGAG 

30 CGCGCCATCCACCAGGCCCTGGTGCAGCAGGGCTCGGCCTTCGCCGACCGGCCGGCCTTCGCCTCCTTCCGTGTGGTGTCCGGC 
GGCGGCAGCATGGCTTTCGGCCACTACTCGGAGCACTGGAAGGTGCAGCGGCGCGCAGCCCACAGCATGATGCGCAACTTCTTC 
ACGCGGCAGCCGCGCAGCCGCCAAGTCCTCGAGGGCCACGTGCTGAGCGAGGCGCGCGAGCTGGTGGCGCTGCTGGTGCGCGGC 
AGCGCGGACGGCGCCTTCCTCGACCCGAGGCCGCTGACCGTCGTGGCCGTGGCCAACGTCATGAGTGCCGTGTGTTTCGGCTGC 
CGCTACAGCCACGACGACCCCGAGTTCCGTGAGCTGCTCAGCCACAACGAAGAGTTCGGGCGCACGGTGGGCGCGGGCAGCCTG 

35 GTGGACGTGATGCCCTGGCTGCAGTACTTCCCCAACCCGGTGCGCACCGTTTTCCGCGAATTCGAGCAGCTCAACCGCAACTTC 
AGCAACTTCATCCTGGACAAGTTCTTGAGGCACTGCGAAAGCCTTCGGCCCGGGGCCGCCCCCCGCGACATGATGGACGCCTTT 
ATCCTCTCTGCGGAAAAGAAGGCGGCCGGGGACTCGCACGGTGGTGGCGCGCGGCTGGATTTGGAGAACGTACCGGCCACTATC 
ACTGACATCTTCGGCGCCAGCCAGGACACCCTGTCCACCGCGCTGCAGTGGCTGCTCCTCCTCTTCACCAGGTATCCTGATGTG 
CAGACTCGAGTGCAGGCAGAATTGGATCAGGTCGTGGGGAGGGACCGTCTGCCTTGTATGGGTGACCAGCCCAACCTGCCCTAT 

40 GTCCTGGCCTTCCTTTATGAAGCCATGCGCTTCTCCAGCTTTGTGCCTGTCACTATTCCTCATGCCACCACTGCCAACACCTCT 
GTCTTGGGCTACCACATTCCCAAGGACACTGTGGTTTTTGTCAACCAGTGGTCTGTGAATCATGACCCAGTGAAGTGGCCTAAC 
CCGGAGAACTTTGATCCAGCTCGATTCTTGGACAAGGATGGCCTCATCAACAAGGACCTGACCAGCAGAGTGATGATTTTTTCA 
GTGGGCAAAAGGCGGTGCATTGGCGAAGAACTTTCTAAGATGCAGCTTTTTCTCTTCATCTCCATCCTGGCTCACCAGTGCCjAT 
TTCAGGGCCAACCCAAATGAGCCTGCGAAAATGAATTTCAGTTATGGTCTAACCATTAAACCCAAGTCATTTAAAGTCAATGTC 

45 ACTCTCAGAGAGTCCATGGAGCTCCTTGATAGTGCTGTCCAAAATTTACAAGCCAAGGAAACTTGCCAATAAGAAGCAAGAGGC 
AAGCTGAAATTTTAGAAATATTCACATCTTCGGAGATGAGGAGTAAAATTCAGTTTTTTTCCAGTTCCTCTTTTGTGCTGCTTC 
TCAATTAGCGTTTAAGGTGAGCATAAATCAACTGTCCATCAGGTGAGGTGTGCTCCATACCCAGCGGTTCTTCATGAGTAGTGG 
GCTATGCAGGAGCTTCTGGGAGATTTTTTTGAGTCAAAGACTTAAAGGGCCCAATGAATTATTATATACATACTGCATCTTGGT 
TATTTCTGAAGGTAGCATTCTTTGGAGTTAAAATGCACATATAGACACATACACCCAAACACTTACACCAAACTACTGAATGAA 
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GAAGTATTTTGGTAACCAGGCCATTTTTGGTGGGAATCCAAGATTGGTCTCCCATATGCAGAAATAGACAAAAAGTATATTAAA 
CAAAGTTTCAGAGTATATTGTTGAAGAGACAGAGACAAGTAATTTCAGTGTAAAGTGTGTGATTGAAGGTGATAAGGGAAAAGA 
TAAAGACCAGAAATTCCCTTTTCACCTTTTCAGGAAAATAACTTAGACTCTAGTATTTATGGGTGGATTTATCGTTTTGCCTTC 
TGGTATACTTCCTTACTTTTAAGGATAAATCATAAAGTCAGTTGCTCAAAAAGAAATCAATAGTTGAATTAGTGAGTATAGTGG 
5 GGTTCCATGAGTTATCATGAATTTTAAAGTATGCATTATTAAATTGTAAAACTCCAAGGTGATGTTGTACCTCTTTTGCTTGCC 
AMGTACAGMTTTGAATTATCAGCAMGAAAAAAAA 
AGTMGTCTCATAGGTTAAAAAAAAAAGTCACCAAATAG^^^ 

ATCTTGTTCAGGAAAAGGTTGAATAATATATGCCTTGTGTAATATTGAAAATTGAAAAGTACAACTAACGCAACCAAGTGTGCT 
AAAAATGAGCTTGATTAMTCAACCACCTATTTTTGACATGGAMTGAAGCAGGGTTTCTTTTCTTCACTCAAATTTTGGCGAA 
10 TCTCAAAATTAGATCCTAAGATGTGTTCTTATTTTTATAACATCTTTATTGAAATTCTATTTATAATACAGAATCTTGTTTTGA 
AMTAACCTMTTMTATATTAAMTTCCAMT^ 

GTTCCAGTTGAAGTTAGTGGAAATCTGAACATTGTCCTGTGGAAGGCAGAGAAATCTAAGCTGTGTCTGCCCAATGAATAATGG 
AAAATGCCATGAATTACCTGGATGTTCTTTTTACGAGGTGACAAGAGTTGGGGACAGAACTCCCATTACAACTGACCAAGTTTC 
TCTTCTAGATGATTTTTTGAAAGTTAACATTAATGCCTGCTTTTTGGAAAGTCAGAATCAGAAGATAGTCTTGGAAGCTGTTTG 

15 GAAAAGACAGTGGAGATGAGGTCAGTTGTGTTTTTTAAGATGGCAATTACTTTGGTAGCTGGGAAAGCATAAAGCTCAAATGAA 
ATGTATGCATTCACATTTAGAAMGTGAATTGMGTTTCMGTTTTAAAGTTCATTGCAATTAAACTTCCAAAGAAAGTTCTAC 
AGTGTCCTAAGTGCTAAGTGCTTATTACATTTTATTAAGCTTTTTGGAATCTTTGTACCAAAATTTTAAAAAAGGGAGTTTTTG 
ATAGTTGTGTGTATGTGTGTGTGGGGTGGGGGGATGGTMGAGAAAAGAGAGAAACACTGAAAAGMGGAAAGATGGTTAAACA 
TTTTCCCACTCATTCTGAATTAATTAATTTGGAGCACAAAATTCAAAGCATGGACATTTAGAAGAAAGATGTTTGGCGTAGCAG 

20 AGTTAAATCTCAMTAGGCTATTAAAAAAGTCTACAACATAGCAGATCTGTTTTGTGGTTTGGAATATTAAAAMCTTCATGTA 
• ATTTTATTTTAAMTTTCATAGCTGTACTTCTTGMTATAAAAAATCATGCCAGTATTTTTAAAGGCATTAGAGTCAACTACAC 
AAAGCAGGCTTGCCCAGTACATTTAAATTTl'TTGGCACTTGCCATTCCAAAATATTATGCCCCACCAAGGCTGAGACAGTGAAT 
TTGGGCTGCTGTAGCCTATTTTTTTAGATTGAGAAATGTGTAGCTGCAAAAATAATCATGAACCAATCTGGATGCCTCATTATG 
TCAACCAGGTCCAGATGTGCTATAATCTGTTTTTACGTATGTAGGCCCAGTCGTCATCAGATGCTTGCGGCAAAAGAAAGCTGT 

25 GTTTATATGGAAGAAAGTAAGGTGCTTGGAGTTTACCTGGCTTATTTAATATGCTTATAACCTAGTTAMGAAAGGAAMGAAA 
ACAAAAAACGAATGAAAATAACTGAATTTGGAGGCTGGAGTAATCAGATTAeTGCTTTAATCAGAMCCCTCATTGTGTTTCTA 
CCGGAGAGAGAATGTATTTGCTGACAACCATTAAAGTCAGAAGTTTTACTCCAGGTTATTGCAATAAAGTATAATGTTTATTAA 
ATGCTTCATTTGTATGTCAAAGCTTTGACTCTATAAGCAAATTGCTTTTTTCCAAAACAAAAAGATGTCTCAGGTTTGTTTTGT 
GAATTTTCTAAAAGCTTTCATGTCCCAGAACTTAGCCTTTACCTGTGAAGTGTTACTACAGCCTTAATATTTTCCTAGTAGATC 

30 TATATTAGATCAAATAGTTGCATAGCAGTATATGTTAATTTGTGTGTTTTTAGCTGTGACACAACTGTGTGATTAAAAGGTATA 
CTTTAGTAGACATTTATAACTCAAGGATACCTTCTTATTTAATCTTTTCTTATTTTTGTACTTTATCATGAATGCTTTTAGTGT 
GTGCATAATAGCTACAGTGCATAGTTGTAGACAAAGTACATTCTGGGGAAACAACATTTATATGTAGCCTTTACTGTTTGATAT 
ACCAAATTAAAAAAAAATTGTATCTCATTACTTATACTGGGACACCATTACCAAAATAATAAAAATCACTTTCATAATCTTGAA 
AAAA 

35 139 ' 

MDISTRSKDPGSAERTAQKRKFPSPPHSSNGHSPQDTSTSPIKKKKKPGLLNSNNKEQSELRHGPFYYMKQPLTTDPVDWPQD 

GRNDFYCWVCHREGQVLCCELCPRWHAKCLRLTSEPEGDWFCPECEKITVAECIETQSKAMTMLTIEQLSYLLKFAIQKM 

GTDAFQKPVPLEQHPDYAEYIFHPMDLCTLEKNAKKK 

EVCPECYLMCQKRDNWFCEPCSNPHPLWAKLKGFPFWPAKALRDKDGQVDARFFGQHDRAW 
40 SIFNSAMQEMEVYVENIRRKFGVFNYSPFRTPYTPNSQYQMLLDPTNPSAGTAKIDKQEKVKLNFDMTASPKILMSKPVLSGGT 
GRRISLSDMPRSPMSTNSSVHTGSDVEQDAEKKATSSHFSASEESMDFLDKSTASPASTKTGQAGSLSGSPKPFSPQLSAPITT 
KTDKTSTTGSILNLNLDRSKAEMDLKELSESVQQQSTPVPLISPKRQIRSRFQLNLDKTIESCKAQLGINEISEDVYTAVEHSD 
SEDSEKSDSSDSEYISDDEQKSKNEPEDTEDKEGCQMDKEPSAVKKKPKPTNPVEIKEELKSTSPASEKADPGAVKDKASPEPE 
KDFSEKAKPSPHPIKDKLKGKDETDSPTVHLGLDSDSESELVIDLGEDHSGREGRKNKKEPKEPSPKQDWGKTPPSTTVGSHS 
45 PPETPVLTRSSAQTSAAGATATTSTSSTVTVTAPAPAA^ 

QQQQQQQNQQQQPQSSQGTRYQTRQAVKAVQQKEITQSPSTSTITLVTSTQSSPLVTSSGSMSTLVSSWADLPIATASADVAA 
DIAKYTSKVNGCNKRNNDRNIQRSF 

140 

GGAGTGTAGAGGAAATAAAATCATGTCTGTGGCTCATGACAGCATTCCATATTCAGTTGTTGGGTTGTGTTGTTGTTGTTGTGT 
50 GTGTTTTTMTAATACCATGTMCTGTTTAAAAAAATTTCCTTTTTGATCAAGTCCCAAATCTGCCAGCCTGTTCTCTCTTCCT 
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AACTGTGGAAAAATGAGACGGTTGCTTGMCATTTGTTGGCTTGGCTGMGAGGAMTAAAAACAGAACAGGAGGTGGTAGAGG 
GCATGGATATCTCTACTCGCTCCAAAGATCCTGGCTCTGCAGAGAGAACAGCCCAGAAAAGAAAGTTCCCCAGCCCTCCACATT 
CTTCCAATGGCCACTCGCCGCAGGACACATCAACAAGCCCCATTAAAAAGAAAAAGAAACCTGGCTTACTGAACAGTAACAATA 
AGGAGCAGTCAGAACTAAGACATGGTCCGTTTTACTATATGAAGCAGCCACTCACCACAGACCCTGTTGATGTTGTACCGCAGG 
5 ATGGACGGAATGATTTCTACTGCTGGGTTTGTCACCGGGAAGGCCAAGTCCTTTGCTGTGAGCTCTGTCCCCGGGTTTATCACG 
CTAAGTGTCTGAGACTGACATCGGAACCAGAGGGGGACTGGTTTTGTCCTGAATGTGAGAAAATTACAGTAGCAGAATGCATCG 
AGACCCAGAGTAAAGCCATGACAATGCTCACCATTGAACAGTTATCCTACCTGCTCAAGTTTGCCATTCAGAAAATGAAACAGC 
CAGGGACAGATGCATTCCAGAAGCCCGTTCCATTGGAACAGCACCCTGACTATGCGGAATACATCTTCCATCCAATGGACCTTT 
GTACATTGGAAAAGAATGCGAAAAAGAAAATGTATGGCTGCACAGAAGCCTTCCTGGCTGATGCAAAGTGGATTTTGCACAACT 

10 GCATCATTTATAATGGGGGAAATCACAAATTGACGCAAATAGCGAAAGTAGTCATCAAAATCTGTGAACATGAGATGAATGAAA 
TCGAAGTATGTCCAGAATGTTATCTAGCTGCTTGCCAAAAACGAGATAACTGGTTTTGTGAGCCTTGTAGCAATCCACATCCTT 
TGGTCTGGGCCAAACTGAAGGGGTTTCCATTCTGGCCTGCAAAAGCTCTAAGGGATAAAGACGGGCAGGTCGATGCCCGATTCT 
TTGGACAACATGACAGGGCCTGGGTTCCAATAAATAATTGCTACCTCATGTCTAAAGAAATTCCTTTTTCTGTGAAAAAGACTA 
AGAGCATCTTCAACAGTGCCATGCAAGAGATGGAGGTTTACGTGGAGAACATCCGCAGGAAGTTTGGGGTTTTTAATTACTCTC 

15 CATTTAGGACACCCTACACACCCAACAGCCAGTATCAAATGCTGCTCGATCCCACCAACCCCAGCGCCGGCACTGCCAAGATAG 
ACAAGCAGGAGAAGGTCAAGCTCAACTTTGACATGACGGCATCCCCCAAGATCCTGATGAGCAAGCCTGTGCTGAGTGGGGGCA 
CAGGCCGCCGGATTTCCTTGTCGGATATGCCGCGCTCCCCCATGAGCACAAACTCTTCTGTGCACACGGGCTCCGACGTGGAGC 
AGGATGCTGAGAAGAAGGCCACGTCGAGCCACTTCAGTGCGAGCGAGGAGTCCATGGACTTCCTGGATAAGAGCACAGCTTCAC 
CAGCCTCCACCAAGACGGGACAAGCAGGGAGTTTATCCGGCAGCCCAAAGCCCTTCTCTCCTCAACTGTCAGCTCCTATCACGA 

20 CGAAAACGGACAAAACCTCCACCACCGGCAGCATCCTGAATCTTAACCTGGATCGAAGCAAAGCTGAGATGGATTTGAAGGAGC 
TGAGCGAGTCGGTCCAGCAACAGTCCACCCCTGTTCCTCTCATCTCTCCCAAGCGCCAGATTCGTAGCAGGTTCCAGCTGAATC 
TTGACAAGACCATAGAGAGTTGCAAAGCACAATTAGGCATAAATGAAATCTCGGAAGATGTCTATACGGCCGTAGAGCACAGCG 
ATTCGGAGGATTCTGAGAAGTCAGATAGTAGCGATAGTGAGTATATCAGTGATGATGAGGAGAAGTCTAAGAACGAGCCAGAAG 
ACACAGAGGACAAAGAAGGTTGTCAGATGGACAAAGAGCCATCTGCTGTTAAAAAAAAGCCCAAGCCTACAAACCCAGTGGAGA 

25 TTAAAGAGGAGCTGAAAAGCACGTCACCAGCCAGCGAGAAGGCAGACCCTGGAGCAGTCAAGGACAAGGCCAGCCCTGAGCCTG 
AGAAGGACTTTTCCGAAAAGGCAAAACCTTCACCTCACCCCATAAAGGATAAACTGAAGGGAAAAGATGAGACGGATTCCCCAA 
CAGTCCATTTGGGCCTGGACTCTGATTCAGAGAGCGAACTTGTCATAGATTTAGGAGAAGACCATTCTGGGCGGGAGGGTCGAA 
AAAATAAGAAGGAACCCAAAGAACCATCTCCCAAACAGGATGTTGTAGGTAAAACTCCACCATCCACGACGGTGGGCAGCCATT 
CTCCCCCGGAAACACCGGTGCTCACCCGCTCTTCCGCCCAAACTTCCGCGGCTGGCGCCACAGCCAGCACCAGCACGTCCTCCA 

30 CGGTCACCGTCACGGCCCCGGCCCCCGCCGCCACAGGAAGCCCAGTGAAAAAGCAGAGGCCGCTTTTACCGAAGGAGACTGCCC 
CGGCCGTGCAGCGGGTCGTGTGGAACTCATCAAGTAAGTTTCAAACGTCCTCCCAAAAGTGGCACATGCAGAAGATGCAGCGTC 
AGCAGCAGCAGCAGCAGCAGCAAAACCAGCAGCAGCAGCCTCAGTCTTCCCAGGGGACGAGATATCAGACCAGACAGGCTGTGA 
AAGCTGTCCAGCAGAAGGAGATCACACAGAGCCCATCCACGTCCACCATCACCCTGGTGACCAGCACACAGTCATCGCCCCTGG 
TCACCAGCTCGGGGTCCATGAGCACCCTTGTGTCCTCAGTCAACGCTGACCTGCCCATCGCCACTGCCTCAGCTGATGTCGCCG 

35 CTGATATTGCCAAGTACACTAGCAAAATGATGGATGCAATAAAAGGAACAATGACAGAAATATACAACGATCTTTCTAAAAACA 
CTACTGGAAGCACAATAGCTGAGATTCGCAGGCTGAGGATCGAGATAGAGAAGCTCCAGTGGCTGCACCAGCAAGAGCTCTCCG 
AAATGAAACACAACTTAGAGCTGACCATGGCGGAGATGCGGCAGAGCCTGGAGCAGGAGCGGGACCGGCTCATCGCCGAGGTGA 
AGAAGCAGCTGGAGTTGGAGAAGCAGCAGGCGGTGGATGAGACCAAGAAGAAGCAGTGGTGCGCCAACTGCAAGAAGGAGGCCA 
TCTTTTACTGCTGTTGGAACACCAGCTACTGTGACTACCCCTGCCAGCAAGCCCACTGGCCTGAGCACATGAAGTCCTGCACCC 

40 AGTCAGCTACTGCTCCTCAGCAGGAAGCGGATGCTGAGGTGAACACAGAAACACTAAATAAGTCCTCCCAGGGGAGCTCCTCGA 
GCACACAATCAGCACCTTCAGAAACGGCCAGCGCCTCCAAAGAGAAGGAGACGTCAGCTGAGAAAAGCAAGGAGAGTGGCTCGA 
CCCTTGACCTTTCTGGCTCCAGAGAGACGCCCTCCTCCATTCTCTTAGGCTCCAACCAAGGCTCTGACCATTCCCGGAGTAATA 
AATCCAGTTGGAGCAGCAGTGATGAGAAGAGGGGATCGACACGTTCCGATCACAACACCAGTACCAGCACGAAGAGCCTCCTCC 
CGAAAGAGTCTCGGCTGGACACCTTCTGGGACTAGCAGTGAATCGGGACACAAACCACCCACCCCATTGGGAGAAAAACCCAGA 

45 CGCCAGGAAAAGAAGAAACAAGAAAGGCAGGAGAACAGCCACTTTCAGACTTGAAAATGAGAAAACCCTCAGTTGAGCCTGAGC 
CCCCGGCGCGGGGGCTGCTACACTACAGGACACCCAGCATCGGCTTTGACTGCAGACTGTTCACCCACACGAGCCCTGTGCTTT 
TGGTGTAAATAATGTACAATTTGTGGATGTCATTGAATCTAGAGGACTTTCCCCTTTTTATATTTGTATTAACTTTAACTTATT 
AAAAAAAAAAAAAGAAAAAGAAAAACGATTT 

141 

50 MALSSQIWMCLLLLLLLASLTSGSVFPQQTGQLAELQPQDRAGARASWMPMFQRRRRRDTHFPICIFCCGCCHRSKCGMCCKT 
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142 

TCAAGACCCAGCAGTGGGACAGCCAGACAGACGGCACGATGGCACTGAGCTCCCAGATCTGGGCCGCTTGCCTCCTGCTCCTCC 
TCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAGACGGGACAACTTGCAGAGCTGCAACCCCAGGACAGAGCTG 
GAGCCAGGGCCAGCTGGATGCCCATGTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCT 
5 GCTGTCATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTGCCCCCGTCCCCTCCCTTCCTTATTTATT 
CCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTTTCC 

144 

AAAGTCCTTGTAACATTGAGTTACAGGGCTTTAACTCCTGTGTCTGAAAAATCACAAACACTGATGACAATCAAAGCCTCATCT 
TAAGGCCCCGTAGMGATGCCAATCAAAATAAACTGCATTCCTGAGGCACTAGGCMGAAATTAAAGCTATTCAACTCCTCAAG 
10 GCCCAGGGACTATTGCGGAAGAGGTGGGCGCGTAAGATTGTAAGGGCCGATTTTGAAAGATCCAGTAAGTTCAGTTTCTCTATG 
AACTAATCATTCAAGTCAAAGGCACACTGATGCAAAATCAGTATATGGGACCCCTGTGTCTGGATTAGGCAAGGTTTTTCTTGG 
AAGCCTTTAACCAACTCCTTCCNTAAAGGGTTATTAAAAGGGCTTATGGGGAGTTATTATTTT 

145 

MGLPRGSFFWVLLLLTAACSGLLFALYFSAVQRYPGPAAGARDTTSFEAFFQSKASNSWTGKGQACRHLLHLAIQRHPHFRGLF 
15 NLSIPVLLWGDLFTPALWDRLSQHKAPYGWRGLSHQVIASTLSLLNGSESAKLFAPPRDTPPKCIRCAWGNGGILNGSRQGPN 
IDAHDYVFRLNGAVIKGFERDVGTKTC^ 
LPKGDRPHAYFGPEASASKFKLLHPDFISYLTERFLKSK^ 
YFERKMKPLIFYANHDLSLEAALWRDLHKAGILQLYQR 

146 

20 GGGACGTCAGCGGACGGGGCGCTCGCGGGCCGGGGCTGTATGGGGCTCCCGCGCGGGTCGTTCTTCTGGGTGCTGCTCCTGCTC 
ACGGCTGCCTGCTCGGGGCTCCTCTTTGCCCTGTACTTCTCGGCGGTGCAGCGGTACCCGGGGCCAGCGGCCGGAGCCAGGGAC 
ACCACATCATTTGAAGCATTCTTTCAATCCAAGGCATCGAATTCTTGGACAGGAAAGGGCCAGGCCTGCCGACACCTGCTTCAC 
CTGGCCATTCAGCGGCACCCCCACTTCCGTGGCCTGTTCAATCTCTCCATTCCAGTGCTGCTGTGGGGGGACCTCTTCACCCCA 
GCGCTCTGGGACGGCCTGAGCCAACACAAAGCCCCGTATGGCTGGCGGGGGCTCTCTCACCAAGTCATCGCCTCCACCCTGAGC 

25 CTTCTGAACGGCTCAGAGAGTGCCAAGCTGTTTGCCCCGCCCAGGGACACCCCTCCAAAGTGTATCCGGTGTGCCGTGGTGGGC 
AACGGAGGCATTCTGAATGGGTCCCGCCAGGGTCCCAACATCGATGCCCATGACTATGTATTCAGACTCAATGGAGCTGTGATC 
• AAAGGCTTCGAGCGCGATGTGGGCACCAAGACTTCCTTCTATGGTTTCACTGTGAACACGATGAAGAACTCCCTCGTCTCCTAC 
TGGAATCTGGGCTTCACCTCCGTGCCACAAGGACAGGACCTGCAGTATATCTTCATCCCCTCAGACATCCGCGACTATGTGATG 
CTGAGATCGGCCATTCTGGGCGTGCCTGTCCCTGAGGGCCTAGATAAAGGGGACAGGCCGCACGCCTATTTTGGACCAGAAGCC 

30 ■ TCTGCGAGTAAATTCMGCTGCTACATCCGGACTTCATCAGCTACCTGACAGAAAGGTTCTTGAAATCAAAGTTGATTAACACA 
CATTTTGGAGACCTATATATGCCTAGTACCGGGGCTGTCATGCTGCTGACAGCTTTGCATACCTGTGACCAGGTCAGTGCCTAT 
GGATTCATCACAAGCAACTACTGGAAATTTTCCGACCACTATTTCGAACGAAAAATGAAGCCATTGATATTTTATGCAAACCAC 
GATCTGTCCCTGGAAGCTGCCCTGTGGAGGGACCTGCACAAGGCCGGCATCCTTCAGCTGTACCAGCGCTGACCCCAATGCACT 
GAGCGCTTTGCTTCTTCAAGAGTTGCGGCCCTGATCCTCTCAAGTGGCCAAAAGCTTTTTTAACTTTTCAATCTTCACCTTCCC 

35 TTGCCAACAGAGGGCACTGGGGTGAATTCAAGATTTTCATCGAGGTCTGTTCAATATAGGACACCCCAGCTTGTCCTTGGCTCA 
TCCAAGAACTCTTCTGTATCTAAAACAATACATCTCAATCTTGGCCAAGGGAAAATGGACTGCTTTGCTGGATTGGCACTGAGC 
AACTTTAGGAAATGTCGGTGGAGTGTTCAGCAAGATCAGACAGCAGTCCAGGTCAAAGGCAAACACACACGCTCCAGCCCAAAT 
CCTCCTGGTGGCACATCCTACCCCAGATGCTAAAGTGATTCAAGGACTCCAGGACACCTCTTAAGAGCCTTTCTAAGAACATGA 
TAGGCTTACTTCTGCTCCATAATAAAGTGGGAGAAAAAAGCCAGAATATAACTTAAGACTAGATAACTGCGTACATGATGGACC 

40 ATTTTTTTTTTTTTTGGCTGGGTAGAGAAATCATATAAAACGCAGGCTGTTTAGCATGGAGATGACTCTCAGAACACTGGGAGG 
GTCTGGCACTTGATGGGGGTTAGTTGCTTGGCAGCCTGCCTGCCACTGAGGGAAG.TCCCATTAGAGATGTATCACCACCTTGTC 
ACCAACAGGATGATGTCACCAACAGGATGATGTCACCAGGTAATAAACCTTCATCCTCAC 

• 148 

GGGGATAAGTTTTTTTTTTTTCTTTTTTTTCTTTTCTCTAGTTTTTCTTTCTGGCTGAGATTTCCGTGCAAGACAGCACCCAAT 
45 AGACTATTTAGAGTTGACATTTGACATTTTAATGGGCGCCATGGCTCATTTTGTAGATTGAGAAGGTGCGTCTCCCCTGCTCCA 
AGTCTCATCATGACAGCGTGCTGACAGCTGGGAGTCTGTGGCCTTCCTCACGCAGAGGCCTTAAAGCTGGACACAGATGCACGC 
CTAGGCTGGGCAGGGATGGGACCCATGCCCCCTCCTTAGAGGACGGGCTTCCTGGTTAGGAAAGGACACGTGGGGGTGCCTTGC 
ATAATAGTTCACTGGTCACCGTGCTTTTATGAGTAGTGTTTTTGTGCACTTGCCAGGGGTTTTCTCTCTGTGTGCGAGGGGAGT 
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GATTTAAGCAATGGTGTCTGGAGTAAGCCTTACAATTTTAATAGACTTTTTCTTATCATATCCCTCATTTCTTTCCCTGAAATA 
• AAAATACACACAAGCAAAAAAAAAATGATAGTTTCAC^ 

CCAGGATTTTCCTACATAGTCAGAACTTACACATTACTAGAGGCACACCCACCAAGGAGTATTGTGTCTACTTTTATCTGTGCA 
CCAGCCACAAATACCCACATTGGAAAGACCCATTTGTGATGGGTAAACATCCCTTCCTGTCTCCCACAACCCCTGTGACTGCCC 
5 TGCATGTGTTCATGACCTCCGAAGGCCCTAATTCATGAAGCAGCAAACCCAGCAGATCTCCACCCCCCTGCCTCAGGACCTCTG 
CTGAAGAGGGGGATGAAGTGGGTCTCCAGGGAGGCAGTGGGGGCCTTGTTGGCAGCTGGCTCGGGAGCCGGCTTACAGGAGGGC 
AGCTCTGCAGTTGGGAGGGGCACCGTCCGGAGGAGACCAGGCCTCTACACACCCCCCACTCTACTTATCATCCCTGCTCACACA 
CCCTTGTCCAAGGCTTTATGCATCGGATTTATTTTTCCAAATCGAGAGGACAGTGATAGATGCATTTTCCCCAGGCTGTCTCAG 
AAAGGTCGCTAAATGTATACTGTTGTCAGAATTGCTGAGATCTCCCCCCACTTTTGGTTTTTGCAGCAGTAAAAACTCTTTCCA 

10 CTGTGACTTATTTTCTCTCTCAGGCAGCCAGCCACCTGGTCCCTTGTGCTGACTCTAGCACAGTGGCCAGGATCCAATACGAGT ' 
CCAGGGGTGACCGCAGGATGGTGGGGGCAGCGGGCTTCTCCACCTACCCCAGCCACCAAGGCCCTGACGCACTGCCTCCTGCAC 
CTTCAGCACATCCCTGTGCACAGCTGGAAGGGTGCATGGCCCGCTCACCTTTGTTCAGATGGGTGGAAACGCTGATGATACCAG 
CTCCTCCCTGCCGTGCCCCTGCCACGGAGCAGGCATTGTGAACTGGCTGGTGTTTGCAGTCCCACGTGGCATGGCCTCCAGCCC 
AACCCACAGTGGAGACTGGAGACAGGGCAATGAGTCTGGTCGGGGGCACGTGGACATGCCCCATAGGGGCCCCACCCAGACTTA 

15 ACAGGCAAGGTCCTGGGCATTGCGCGACGCAGGACTCAATGCTAAAGCAAGCCTGCCTGGCTCTGTGCCAGGGCCCCTCTTCTG 
ATTTACACATCCCATTTTTACACAGACCCTTCCTTCTTAATAAAGGCTGACAGTTCTGTTGGCAGCCAAGAACCCAGACCATGA 
AGACAGGGAGTGAGGGGCCTTTGTGCCCAACTCCAGCACAGCTGCGTTCTGGGGTGTGTGAGAGGCATGTTCGTGTCTGTGCGC 
TGGTGGTCTCGTGAGACAGTTCCGAGGACGGGGAAATTGCAGGGTGGTGGGGGCGTGAGGCTTATATGTGGAACTGATGCAGAG 
TTCGCCTGCAGACGGATCTGGATATACACTATGTATAATTGTTACGTGTAATTTAAAATATATCTGTTTGCCATCGTCATGAGA 

20 AGATTATATGTAAGGCTCTGAAGGGAGAGGGAGATGTACATTCTGCCAGGCTCCTGGGGACCTTATGCGAGTCATGAAATTGAT 
GACTGTTGATCCAGTGGTGCAAGAAGCTACACTCCATGTGTCATCACGCTTATGACTCCTAATGTATTTTTAAGGCAAAAAATG 
TCAGCCGACTCCATCTTCACCCCTCGATTCCTCGAGTCCAGCCTTTCTGTGCCAGTGCTTCACTGAGCCACAACGCTCTCGCCA 
' TCGGGACCCGGCTGGGCCTGGAGTCTCGGGGCACAGTTGCCATGGAGCCCTCCTGGGTCATTCTACAAATGTGCTGAGTGCCAG 
CTGAAAACCCCACAGGAGATGGAGTACCTTGGCCAAGCTTAAAGAGAAGATTTTCTCAGGGTATTTATTAGTGTGTCCAGCAGG 

25 GTCAGGAAGCAGGATGGAAAGATGCATTCAGACTGTTAATTTATTAACAAGGCAAATGATTTTGTGTTTCTTGATGACAGACTA 
TTAAGTTTGGGACTTATTTTCCCATTTGAGAAGTTATAATATATATTTAAGATGATAAGTTTCCTGCTTAAGTTGTGCCTTTCA 
GCTTCAATGAGTTTAAGGAGCACTAAGGCTAATGATACCAATGAGGGTTGGTTTATTATCAAACCTGAATAGCTGTGGTTTCTC 
CAGTAAATATTTTCTTCTACTGAACATGGAGCCATTATTAAGAGTTGTGTGTTTTTTATTATGTACATTTGTATATTTTTTTGC 
TTGTTTGATGTTCTATTTTTCTAATAGTTTTCTTTTAGTTTCTTAAAGTTGTGATACTAGATTTAGATTCTGATGCTAACTGCA 

30 AATCAGGTTGGTCTCTGCTGGGTCTCTCCTGCTTTTATTTTACTTTAAGGACAAGTGTAGTTGTCGTCCACCACCTTTCAAAAA 
ATGTGAAACTGCCCTGCCTCCCCTTTTTGCTGACAACACTGTGTACATTGACCACTTCCTACCATACTTTATGTTGTAAAATCA 
AACTCTTTTGTGGTACATTATCTCATGCTTCTGCAAATTCGAATAAATTCTATGGCTTCCAAAAAAAAAAAAAAAAAAM 

150 

AAGATCTATGTCCTGGGGGGCCGCCAGGGCAAGCTCCCGGTGACTGCTTTTGAAGCCTTTGATCTGGAGGCCCGTACATGGACC 

35 CGGCATCCAAGCCTACCCAGCCGTCGGGCCTTTGCTGGCTGCGCCATGGCTGAAGGCAGCGTCTTTAGCCTGGGTGGCCTGCAG 
CAGCCTGGGCCCCACAACTTCTACTCTCGCCCACACTTTGTCAACACTGTGGAGATGTTTGACCTGGAGCATGGTGAGCAGTGG 
CTGTTCTGGGCTGTCCTCCCGCTCTCTGTGGGATGGAGGGGCATAGTGTGTACATGACTAGATCTGACCTCCCCTCTCCTGCAG 
GGTCCTGGACCAAATTGCCCCGCAGCCTGCGCATGAGGGATAAGAGGGCAGACTTTGTGGTTGGGTCCCTTGGGGGCCACATTG 
TGGCCATTGGGGGCCTTGGTAAGTCTCTATGGGGCTGGGGAGAGGAGGGAGTCCCAAGACAGGAAAGACTAGCCCCCAGCATGT 

40 GTGTCACCTTCTGCCCATCTCCAGGCACTCCAGGGGTCAGGGCTTTGTGAGCTCTTTTCCCTATCTAT<jAAGTAGGCAGGATGT 
GGTCTGCCTGGCCCAGCACTCAGGGTGTTCCTCAGTGACTGGGGGCTCTGTCAAGCACCAGCAAGGGTCTACGAGACAGCAGCA 
GGACAMTGACCACTGTATGCACAGTCMGAGAACATATAGAAACATAAATGMTGAACAAATGATGAACATAGGCAATGAGGA 
GCAAGGCCCTGATAGCTCATCCTGGGGTGATAACTACTGGGCTATTCTCTGGGCTTAGGGAGCAGAAGGGCCTGGAGTCCCTAC 
- GGCCTGTGCCTCTCACAGCCTTCTCTCTCCTTCTTGCAGGAAACCAGCCATGTCCTTTGGGCTCTGTGGAGAGCTTTAGCCTTG 

45 CACGGCGGCGCTGGGAGGCATTGCCTGCCATGCCCACTGCCCGCTGCTCCTGCTCTAGTCTGCAGGCTGGGCCCCGGCTGTTTG 
TTATTGGGGGTGTGGCCCAGGGCCCCAGTCAAGCCGTGGAGGCACTGTGTCTGCGTGATGGGGTCTGAAGGCTTGGTGGGAGCT 
GTCCACTGGAGCAGCTCATTGCCAGAGGCAGCTATTTCTATGGCTCCTTTTGCTGCTGAGGACACTCACTGTGGCTCTGTGGGA 
TGAGAGAGGCATGGGGGTGAGCACTTGAAACACTGCCTTGGGGCCTTGGGTTAGGGGAGCCTTTGTCTTTAGTGCAGGACACAC 
ATATGCTTACACCTACCTTTATCACCATTCGTTCATGAATCATGCCTAGCTCCATCCTTGCCCTGGGACCTACTAGGCCTTCCA 

50 TCCAACTGGGAAATGGGGAGAAGCAAAGCTGGCCCCATGCTCTTCAGGGTCAGTTCCTATCTGGAGTTGACCAGGCCTACCCCA 
GTTGCCATTCCTGAAAAATCTCAGCTGCCAGGCTGCCTTTAGGGTCCCTGTAGACCCAGGAGAGTTGAGAGGGTGGGGGACACA 
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GAGAGAATAGAGAGGATGTGGGAACTGCCAGAGGGCCGGAGCGCAGGAGTTCAAGTGGAGGAATGCTGGCTTTGAGCCCTCTAC 
ACTGCTGGTTGTATGACCTTGGACAAGTCACTTCACCTCTCTGTGCCTCAGCATCCTCATCTATAAATGGGGATCTCTGAAACC 
TTCCTACCCTACCTACCTCACAGGGCTGTTGTGAGGACCCAGGGAGTTTGGATGTGGAAGTAAAAGTGCTGCTAAAACCT 

151 

5 MVRSRLTAVSASWQAHPPADMGRRKSKRKPPPKKKMTGT^ 
LSEPVDVYSDWIDACEAANQ 

152 

GGCACGAGGGGCTCATCCAGCTGCAGACATGGGGCGCAGAAAGTCAAAACGAAAGCCGCCTCCCAAGAAGAAGATGACAGGCAC 
CCTCGAGACCCAGTTCACCTGCCCCTTCTGCAACCACGAGAAATCCTGTGATGTGAAAATGGACCGTGCCCGCAACACCGGAGT 

10 -CATCTCTTGTACCGTGTGCCTAGAGGAATTCCAGACGCCCATAACGTATCTGTCAGAACCCGTGGATGTGTACAGTGATTGGAT 
AGACGCCTGCGAGGCGGCCAATCAGTAGCGACACAGAGGACCCGCCCCCTGAGCAGCCCCGCGTACTGTGGATCCAGCTGTTCG 
GTTCTGGTCCAGAGACATTCCAGGGGTCCAGGGTGTGGGTCCTGGGCTGTCACAGCCGTGTGTGTGTGTGTGTGTGTGTGTGTG 
TGTGTGTGTGTGTGTGTAGTGGGTGTGCGTGTGGGTGTGGGTGTGAGTGAGTGTGGGTGTGTGTGGCTGCACGTGTCACTGGGG 
TGGCCGTGAGTGTGTGCTCACAGGTACGCGGTGGTGTCGGGTTCCTGGGCCTGAGGGGCCTGAACTGATCTCACTTGGCTCCGA 

15 AAGCCTTTGCTGTGTTCCCTGCAGCCCCTGGCCCCCCAGCCTTGGGGCTCTGGCTCCCCCCGGCGGAATTGGGGGACTGTTTCC 
TGACATCCTGGACAAGGGAAGCCCACTAGAGGCTGGAACAGGACCTCTCCAGCCTCCTCACCAGCACCGTGCCCATCTCAACTG 
GACTTCCCGCCCTCCTTCTCCACCTTCTAGTGCCCGTGGCCGGGGATTCAAAGCCGCCGTTCCCCAGGTCCCTGGGCTGGGCCC 
TGACAGGGAGCCGCCCCCCTCCCCATGGTAACCAGGAAGCCCGTTTCATGTTCAGTTGCTTTTGTAGAGGAAGCAAGGGCTGGG 
ATGGGGACAGCTGTCAATCACAAGCCCTTAAATAAAGCAGCCAGCGCACAAAAAAAAAAAAAAAAAA 

20 153 

MPPPSDIVKVAIEWPGANAQLLEIDQKRPLASIIKEVCDGWSLPNPEYYTLRYADGPQLYITEQTRSDIKNGTILQLAISPSRA 
ARQLMERTQSSmETRLDAMKELAKLSADVTFATEFIMDGIIVLTRLVESGTKLLSHYSEMLAFTLTAFLELMDHGIVSWDW 
SITFIKQIAGYVSQPMVDVSILQRSLAILESMV^^ 

DMMAFAQKHLRSIILNHVIRGNRPIKTEl^^ ■ 
25 GTEKRKAMYTKDYKMLGFTNHINPAM^^ 

LQVGELPNEGRNDYHPMFFTHDRAFEELFGICIQLLNKTWKEMRATAEDFNKVMQWREQITRALPSKPNSLDQFKSKLRSLSY 
SEILRLRQSERMSQDDFQSPPIVELREKIQPEILELIKQQRLNRLCEGSSFRKIGNRRRQERFWYCRLALNHKVLHYGDLDDNP 
QGEVTFESLQEKIPVADIKAIVTGKDCPHMKEKSALKQNKEVLELAFSILYDPDETLNFIAPNKYEYCIWIDGLSALLGKDMSS 
ELTKSDLDTLLSMEMKLRLLDLENIQIPEAPPPIPKEPSSYDFVYHYG 

30 154 

GGAGGGAGGTGTAGAAAGAGGTACATGGAGAACAAGTTTGTCAATCCGTCTGAACTTCAGTTGCCTTACCTGTAAGGCAGCCGT 
GTCTGTGTTTTTGTCTCGCAGAATTAGAGCCCATTGGGAACGATGCCACCACCGTCAGACATTGTCAAAGTGGCCATTGAGTGG 
CCAGGTGCTAACGCCCAGCTCCTTGAAATCGACCAGAAACGGCCCCTGGCATCCATTATCAAGGAAGTTTGTGATGGGTGGTCG 
TTGCCAAACCCAGAGTATTATACCCTCCGTTATGCAGATGGTCCTCAGCTGTACATCACCGAACAGACTCGCAGTGACATTAAG 

35 AATGGGACAATC.TTACAACTGGCTATCTCCCCGTCCCGGGCTGCACGCCAGCTGATGGAGAGGACCCAGTCATCCAACATGGAG 
ACCCGGCTGGATGCCATGAAGGAGCTGGCCAAGCTCTCTGCCGACGTGACTTTGGCTACTGAGTTCATCAACATGGATGGCATC 
ATTGTGCTGACAAGGCTCGTGGAAAGTGGAACCAAGCTCTTGTCCCAGCTACAGTGAGAXGCTGGCATTCACCCTGACTGCCTT ■ 
CCTAGAGCTCATGGACCATGGCATTGTCTCCTGGGACATGGTTTCAATCACCTTTATTAAGCAGATTGCAGGGTATGTGAGCCA 
GCCCATGGTGGACGTGTCAATCCTTCAGAGGTCCCTGGCCATCCTGGAGAGCATGGTCTTGAACAGCCAGAGTCTGTACCAGAA ■ 

40 GATAGCCGAGGAAATCACCGTGGGACAGCTCATCTCACACCTCCAGGTCTCCAACCAGGAGATTCAGACCTACGCCATTGCACT 
GATTAATGCACTTTTTCTGAAGGCTCCTGAGGAC 

AATCCTGAATCATGTGATCCGAGGGAACCGCCCCATCAAAACTGAGATGGCCCATCAGCTATATGTCCTTCAAGTCCTAACCTT 
TAACCTTCTGGAAGAAAGGATGATGACCAAGATGGACCCCAATGACCAGGCTCAAAGGGACATCATATTTGAACTGAGGAGGAT 
TGCATTTGACGCAGAGTCTGATCCTAGCAATGCCCCTGGGAGTGGGACCGAAAAACGCAAAGCCATGTACACAAAGGACTACAA 
45- AATGCTGGGATTTACCAACCACATCAATCCAGCCATGGACTTTACCCAGACTCCTCCTGGAATGCTGGCGTTGGACAACATGCT 
GTACTTGGCTAAAGTCCACCAGGACACCTACATCCGGATTGTCTTGGAGAACAGTAGCCGGGAAGACAAACATGAATGCCCCTT 
TGGCCGCAGTGCCATTGAGCTCACCAAAATGCTCTGTGAAATCCTGCAGGTTGGGGAACTACCAAATGAAGGACGCAATGACTA 
CCACCCGATGTTCTTTACCCATGACCGAGCCTTTGAAGAGCTCTTTGGAATCTGCATCCAGCTGTTGAACAAGACCTGGAAGGA 



